STA258 Lecture 24
- Some of the questions from previous finals.
- 1
- a:
- Youtuber MrBeast.
- Challenge videos as well as philanthropy.
- Is there a difference between the subscription rates between the two types of videos?
- Group A: Challenge videos.
- Group B: Philanthropy videos.
- First 48 hours of the release.
- Hypotheses:
- Group A:
- Group B:
- Values might not be right hard to see.
- Since it's a sample, we need to calculate the sample means and standard deviations.
- It also means we can't use we need to use or something of that sort.
-
-
- Suppose sample variances are equal, we can pool them together.
- Formula:
- In R
2*pt(0.09, df=8, lower.tail=FALSE) gives
- , so we fail to reject the null hypothesis.
-
- is very small, so we reject the null hypothesis.
- Evidence suggests that the subscription rates are different between the two types of videos.
- b:
- Let be a set of data points.
- Regression model: for
- is the predicted value of based on the regression model.
- Find the least squares estimates and .
- Consider using .
-
- Group A:
- Group B:
-
-
- We want to minimize
- Differentiate with respect to and and set to 0.
-
- 9
- Analyse mrbeasts videos
-
- a:
- Provide interpretations of both the intercept and slope of the best model for the data.
- Are there any issues with the model?
- For every log unit increase in view count, we expect the log like count to increase by 0.99 on average.
- b:
- Interpret the coefficient of determination for this model.
- We have to look at the value for this model.
- This tells us that of the uncertainty in the log like count can be explained by the log view count.
- tells us that there is a strong positive linear relationship between log view count and log like count.
- c:
- Use the model to predict the number of likes for a ivdeo with views.
- So we would expect a video with views to have approximately likes.