(08-25-2022 06:53 PM)bryanw1995 Wrote: (08-25-2022 05:53 PM)e-parade Wrote: (08-25-2022 04:59 PM)bullet Wrote: (08-25-2022 02:40 PM)Yosef181 Wrote: (08-25-2022 02:28 PM)bullet Wrote: Do you not understand the concept of "lies, damn lies and statistics?" You clearly don't understand the article and may not have even read it. Taking one school's best 27 games and comparing it to someone else's 37 games gives you a meaningless result.
There's no "27 best games" against "someone else's 37". One school has 27 data records. Another has 37 data records. That's the complete dataset. It's beyond ridiculous to assume that if you added 10 App games, they would automatically be the 10 lowest App games. That's speculation, and has no basis in the reality of this dataset.
Data "lies" when bias and tampering is introduced. One example of that is to intentionally leave out 10 records you don't like. One way to lessen the effect of bias and tampering is to look at the most complete dataset possible.
The actual, real conclusion, based on the complete dataset: Appalachian State averages more viewers (0.587mm) than East Carolina (0.547mm).
Could that change if more App games were on TV? The only way to find out is to put more App games on TV.
Read the article. You might then understand. As pointed out above, if you use Vanderbilt's 2 data points in 2018, they average better than Miami (FL) does on their 11. All schools have declining curves on their ratings as shown in Miami's curve earlier in the thread.
Of course they do, the curves are showing the games in order of highest viewership to lowest. It's impossible to show anything BUT a decline. The assumption you're making is that the more games you add the lower the viewership becomes, but that's not correct. That would be the case if all the games were in the same season and you plotted them by order of when they happened and it showed a decline, but that's not at all what the data is doing. Dates are irrelevant and it's simply most to least. Ohio State could have a game tomorrow that's their highest rated game. More likely it'll fall somewhere in the middle of their curve though, but it's not a case where adding an additional game will make the curve decline more.
The issue here is that what you're doing is assuming that an additional 10 games for App State would be at the end of the curve as opposed to in the middle of the curve, or even at the beginning of the curve (meaning: any 10 additional games could be their highest viewed one, their lowest viewed one, or somewhere else in the middle).
This is statistical analysis, and therefore you need decent sample sizes to extrapolate what games outside of the given data set will look like. A sample of 2 is not big enough (same with a sample of 6 from my own UMass). A sample of 27 compared to a sample of 37 is much more reasonable. It's not perfect, yeah, but it does a pretty good job at showing a trend.
He is completely correct that you can't just take the lowest 10 and remove them from the East Carolina dataset for a comparison. What you would need to do is randomly remove 10 of them if you want the same number of games to compare against. It should end up showing about what the average is showing now because of how probability works, but it could also make the average higher or lower (but it won't change much if it was truly randomly selected).
Actually that is incorrect. One school only had 27 games that were broadcast at all, the other had 37. You would have to look at the next 10 best games from the 27 member school, you could either make an educated guess on those, use 0, or just take top 27 from both of them to compare.
Yosef is biased b/c he went to one of the schools. I don't blame him for it, I was trying to figure out a way to get A&M ahead of bama, or at least Auburn or LSU, but I couldn't abuse the numbers coherently enough to make it work.
That's not how statistics or ratings work. If one has a sample size of 27 because only 27 games were aired, then you cannot simply create additional games with 0 ratings. If there were no additional games that could be counted then it's your full data set.
Edit here: also, the "educated guess" you'd use for the next 10 games would be the average of the previous 27 games. It's the only reasonable educated guess you could use because the expectation is you'd perform at your average over the long run.
You can't take a TV show that has 10 episode seasons and compare it to a 14 episode season by saying "one of them had only 10 episodes, so we average in 4 0s to compare which did better" nor can you say "we compare only the top 10 to the entire 10." If the 10 episode show averaged 1.5 million viewers per episode and the 14 episode show averaged 1.3 per episode, then the 10 episode show had better average ratings than the 14 episode one. The expectation is that those ratings would continue at the average moving forward.
Now you can say the
total viewership was better for the one with more episodes, but there was also extra cost to producing those extra episodes. The bang for your buck is better with the one that had higher average ratings for 10 episodes.
And don't come back talking about ratings declining through the season and say it's the same for these games. That's not what the charts are showing for the games. Their data points are not in order of when they happened.
You also cannot simply remove the bottom 10 from the other for comparison purposes because, as I said before, this isn't a scenario when you add more games it results in lower ratings. The curve isn't declining further because there are additional games. It's declining because they're ordering it from best performance to worst performance. If you remove the bottom 10, you're no longer calculating an average of an entire portfolio, you're calculating an average of the highest performing parts of your portfolio.
If you want to compare top 10 to top 10, then that's one thing. But you also have to present it as an average of just the top 10s.
Two things can be true here:
1) App State had better
average viewership across its complete portfolio of aired games.
2) ECU had better
total viewership across its complete portfolio of games.
With the original statement being about averages, that's it. The statement was correct. There's no "truing up the data" to adjust for it. The data is the data. There's enough data from both schools for it to be a reasonably calculated average.
ECU had more eyeballs across all of their games. App state had more average eyeballs watching their games.