| Author |
Message |
![]() ![]() ![]() ![]()
Ballasticmissile
Hero Username: Ballasticmissile
Post Number: 10643 Registered: 07-2012 Posted From: 45.56.149.92
Rating: N/A Votes: 0 (Vote!) | | Posted on Tuesday, August 28, 2018 - 06:55 am: |
![]() ![]() ![]() ![]() ![]() |
https://www.google.com/search?rlz=1C9BKJA_enDE766DE766&hl=en -US&ei=yS2FW-akH7C7ggfM06vABA&q=stock+market+&oq=stock+marke t+&gs_l=mobile-gws-wiz-serp.3..0i131i67j0i67l4.1663.1663..38 05...0.0..0.168.168.0j1......0....1.........0i71.lPpNdcxlF-Y Capacity vundi, laziness, and uninspired life is a waste of time. YOLO kada.... But experiences is how you bring meaning to life. Worthiness should be earned with adequate efforts. |
![]() ![]() ![]() ![]()
Inquisitive
Side Hero Username: Inquisitive
Post Number: 2467 Registered: 09-2014 Posted From: 183.83.206.51
Rating: N/A Votes: 0 (Vote!) | | Posted on Wednesday, August 15, 2018 - 09:06 pm: |
![]() ![]() ![]() ![]() ![]() |
Kindal:Inspected IMDB dataset that has 58788 movies from year 1893 to 2005. Strangely, with increase in budget of the movies, the rating are declining. Pearson correlation coefficient: -0.01422905 Though, correlation doesn't lead to causation and requires further investigation, this is an unexpected result.
Has the budget been adjusted for inflation? If not, the result is obvious. Amongst the older movies, some of the bad ones won't even be listed on IMDb. So you'll notice a lot of high rating movies amongst the old ones. Because of this you'll get the feeling that lower budget => higher rating. Truth is that older movies have higher ratings. So budget is indirectly acting as a proxy for how new the movie is. "Sakshi is a most balanced and independent media. This has no affiliation with any political party," Jagan had said. Link: http://www.outlookindia.com/news/article/sakshi-retelecasts- story-omits-antisonia-remarks/701963
|
![]() ![]() ![]() ![]()
Chanakya
Comedian Username: Chanakya
Post Number: 1619 Registered: 04-2015 Posted From: 115.248.37.69
Rating: N/A Votes: 0 (Vote!) | | Posted on Wednesday, August 19, 2015 - 08:44 am: |
![]() ![]() ![]() ![]() ![]() |
Kindal:budget, length(numeric), genere & certification - mpaa (categorical variable) of the movie are independent variables , where as votes and rating are dependent variables. I always run the summary of the dataset, understand the basic characteristics and then run the tests. Beauty with R is, one can write own packages to do these things or use the existing ones. :-) Graphical representation of data is vital for data analysis and you should understand the variables better and gives a sense of how they are affecting each other. In this case you can do scatter plot for budget, length, votes and ratings. Using, categorical data in visualising is key skill and gives great insights. Coming to performance, it's at the discretion of the analyst. I handled the data between 200-400k records on desktop without any issues. If data is huge, do random sampling (many functions exist) and extract 2000-3000 records. Analyse them and unless you are missing something critical, you can generalise these findings on the population.
thanQ for the explanation  You have your way. I have my way. As for the right way, the correct way, and the only way, it does not exist. ~Friedrich Nietzsche |
![]() ![]() ![]() ![]()
Chanakya
Comedian Username: Chanakya
Post Number: 1618 Registered: 04-2015 Posted From: 115.248.37.69
Rating: N/A Votes: 0 (Vote!) | | Posted on Wednesday, August 19, 2015 - 08:44 am: |
![]() ![]() ![]() ![]() ![]() |
~chirutha~: Caste vs behaviour patterns, Political party vs Corruptions, Govt Departments vs fruitful schemes etc
Genuine Data unte why not  You have your way. I have my way. As for the right way, the correct way, and the only way, it does not exist. ~Friedrich Nietzsche |
![]() ![]() ![]() ![]()
~chirutha~
Side Hero Username: ~chirutha~
Post Number: 5290 Registered: 10-2011 Posted From: 192.109.190.88
Rating: N/A Votes: 0 (Vote!) | | Posted on Wednesday, August 19, 2015 - 08:40 am: |
![]() ![]() ![]() ![]() ![]() |
Kindal:
Idantha kadu kani, do an analysis on topics like Caste vs behaviour patterns, Political party vs Corruptions, Govt Departments vs fruitful schemes etc and come here  Be Kool  |
![]() ![]() ![]() ![]()
Kindal
Side Hero Username: Kindal
Post Number: 2711 Registered: 09-2007 Posted From: 14.139.128.21
Rating: N/A Votes: 0 (Vote!) | | Posted on Wednesday, August 19, 2015 - 08:07 am: |
![]() ![]() ![]() ![]() ![]() |
Chanakya:
budget, length(numeric), genere & certification - mpaa (categorical variable) of the movie are independent variables , where as votes and rating are dependent variables. I always run the summary of the dataset, understand the basic characteristics and then run the tests. Beauty with R is, one can write own packages to do these things or use the existing ones. Graphical representation of data is vital for data analysis and you should understand the variables better and gives a sense of how they are affecting each other. In this case you can do scatter plot for budget, length, votes and ratings. Using, categorical data in visualising is key skill and gives great insights. Coming to performance, it's at the discretion of the analyst. I handled the data between 200-400k records on desktop without any issues. If data is huge, do random sampling (many functions exist) and extract 2000-3000 records. Analyse them and unless you are missing something critical, you can generalise these findings on the population. !ntfi |
![]() ![]() ![]() ![]()
Chanakya
Comedian Username: Chanakya
Post Number: 1617 Registered: 04-2015 Posted From: 115.248.37.69
Rating: N/A Votes: 0 (Vote!) | | Posted on Wednesday, August 19, 2015 - 07:48 am: |
![]() ![]() ![]() ![]() ![]() |
Kindal:You are right. I misread it as -0.14 and thought at such high number of ratings (37161681) and 58788 # of movies, would explain some amount of this relationship. With pearson correlation coefficient of -0.014, the relationship is non-existent. Budget and rating are not related. To some extent, increased budget leads increased votes or viceversa. Budget of the movie and # of votes has got nothing to do with the rating of the movie.
Okies! Also how many variables did you use? I can see its huge data, i never thought or tested R performance angle in terms of processing huge data. if you dont mind me questioning ( i know you wouldnt ) - do you draw scatter plots between all variables (atleast which you think might be related) ? You have your way. I have my way. As for the right way, the correct way, and the only way, it does not exist. ~Friedrich Nietzsche |
![]() ![]() ![]() ![]()
Kindal
Side Hero Username: Kindal
Post Number: 2709 Registered: 09-2007 Posted From: 14.139.128.21
Rating: N/A Votes: 0 (Vote!) | | Posted on Wednesday, August 19, 2015 - 07:43 am: |
![]() ![]() ![]() ![]() ![]() |
Chanakya: anything different with Pearson coefficient than the one we get in R? here -0.014 should be read as "very weak relation" between budget & Rating?
You are right. I misread it as -0.14 and thought at such high number of ratings (37161681) and 58788 # of movies, would explain some amount of this relationship. With pearson correlation coefficient of -0.014, the relationship is non-existent. Budget and rating are not related. To some extent, increased budget leads increased votes or viceversa. Budget of the movie and # of votes has got nothing to do with the rating of the movie. !ntfi |
![]() ![]() ![]() ![]()
Chanakya
Comedian Username: Chanakya
Post Number: 1616 Registered: 04-2015 Posted From: 115.248.37.69
Rating: N/A Votes: 0 (Vote!) | | Posted on Wednesday, August 19, 2015 - 07:27 am: |
![]() ![]() ![]() ![]() ![]() |
Kindal:Pearson correlation coefficient: -0.01422905
anything different with Pearson coefficient than the one we get in R? here -0.014 should be read as "very weak relation" between budget & Rating? You have your way. I have my way. As for the right way, the correct way, and the only way, it does not exist. ~Friedrich Nietzsche |
![]() ![]() ![]() ![]()
Kindal
Side Hero Username: Kindal
Post Number: 2708 Registered: 09-2007 Posted From: 14.139.128.21
Rating: N/A Votes: 0 (Vote!) | | Posted on Wednesday, August 19, 2015 - 07:21 am: |
![]() ![]() ![]() ![]() ![]() |
Chanakya:what about "number of votes" vs "movie rating" how are these affecting one another?
# of votes vs rating: 0.1037069 (Very weakly correlated or we can ignore this relationship) # votes vs budget : 0.4412935 (Weakly or near to moderately correlated). Seems, increased budget will increase number of people to rate the movies, and thus increased voting is resulting in reduced ratings for the movie. !ntfi |
![]() ![]() ![]() ![]()
Chanakya
Comedian Username: Chanakya
Post Number: 1614 Registered: 04-2015 Posted From: 115.248.37.69
Rating: N/A Votes: 0 (Vote!) | | Posted on Wednesday, August 19, 2015 - 07:01 am: |
![]() ![]() ![]() ![]() ![]() |
what about "number of votes" vs "movie rating" how are these affecting one another? You have your way. I have my way. As for the right way, the correct way, and the only way, it does not exist. ~Friedrich Nietzsche |
![]() ![]() ![]() ![]()
Kindal
Side Hero Username: Kindal
Post Number: 2706 Registered: 09-2007 Posted From: 14.139.128.20
Rating: N/A Votes: 0 (Vote!) | | Posted on Tuesday, August 18, 2015 - 07:26 pm: |
![]() ![]() ![]() ![]() ![]() |
Inspected IMDB dataset that has 58788 movies from year 1893 to 2005. Strangely, with increase in budget of the movies, the rating are declining. Pearson correlation coefficient: -0.01422905 Though, correlation doesn't lead to causation and requires further investigation, this is an unexpected result. !ntfi |
|