Teaspoon data
Mar. 1st, 2013 12:47 amI has it.
I am still navigating it. At the outset, I was curious about reviews. 87% of the stories on Teaspoon have at least one review (29732 out of 34145 stories). In comparison, 90% of the DW stories on FF.net have at least one review (40590 out of 44917).
There are 235262 reviews on Teaspoon (likely more now, as the scraping is a few hours old), and 550124 reviews on DW FF.net.
The 5-number summary of the number of reviews per story:
One thing that I have for Teaspoon and not for FF.net is the dates of the reviews. So I can look at trends in when reviews are given vs. publish dates. The 5-number summary of the number of days between the initial publish date and the date of a given review is:
But I also looked at the number of reviews vs last update time. Now, last update time can be misleading because authors tweak their stories without necessarily adding content (ETA: actually, I'm not sure now what an update means for a 1-chapter story. I've edited my stories quite a bit but the update dates seem to be static even if the edits happened months or years later. Mods? How does update vs. edit work?**). For example, 5085 stories have update dates not equal to publish dates, and only one chapter. On the other hand, at least for 1-chapter stories, 96% of the tweaks happen within 3 days of publication. So anyway, I'm looking at the 5-number summary of reviews after the last update time, for all reviews that are larger than the last update time:
The true distribution I'm searching for--the distribution of reviews following addition of a new chapter, is likely somewhere between those extremes.
No pictures yet; it's past my bedtime and I am meticulous about graphs.
*Yes, you saw that maximum correctly. There is a story (unslinky's 3-million word tome "Terminal Decay") that has more reviews than the number of words in most of the stories that most people write (!). Literally, 80% of the stories on Teaspoon are 4500 words or less.
**"Updated" is the last time updated, eg, sent through the modding queue. The update date does not change with story edits. The outliers for the 1-chapter stories seem to have at one point had other chapters or notes as chapters to "bump" them up in the archive, that got either consolidated into one chapter or deleted. Thanks
ghost2 and
abates for the clarification.
I am still navigating it. At the outset, I was curious about reviews. 87% of the stories on Teaspoon have at least one review (29732 out of 34145 stories). In comparison, 90% of the DW stories on FF.net have at least one review (40590 out of 44917).
There are 235262 reviews on Teaspoon (likely more now, as the scraping is a few hours old), and 550124 reviews on DW FF.net.
The 5-number summary of the number of reviews per story:
DW FF.net:
summary(story$reviews)
Min. 1st Qu. Median Mean 3rd Qu. Max.
0.00 2.00 4.00 12.25 10.00 1710.00
Teaspoon:
summary(story$reviews)
Min. 1st Qu. Median Mean 3rd Qu. Max.
0.000 1.000 3.000 6.888 6.000 4360.000*
On the whole it looks like Teaspooners may be stingier, but I think also that FF.net may have a higher probability of readers leaving negative reviews than Teaspoon. I'd have to take a sample and read them to find out. One thing that I have for Teaspoon and not for FF.net is the dates of the reviews. So I can look at trends in when reviews are given vs. publish dates. The 5-number summary of the number of days between the initial publish date and the date of a given review is:
summary(reviews$afterpub)
Min. 1st Qu. Median Mean 3rd Qu. Max.
0.0 1.0 14.0 136.4 116.0 3451.0
So this is saying that 50% of the reviews that are in the database were given within two weeks of the initial publish date of the story. There is a big tail, but this also accounts for multi-chapter stories that are being updated in progress. I don't have the update times of the individual chapters. But I also looked at the number of reviews vs last update time. Now, last update time can be misleading because authors tweak their stories without necessarily adding content (ETA: actually, I'm not sure now what an update means for a 1-chapter story. I've edited my stories quite a bit but the update dates seem to be static even if the edits happened months or years later. Mods? How does update vs. edit work?**). For example, 5085 stories have update dates not equal to publish dates, and only one chapter. On the other hand, at least for 1-chapter stories, 96% of the tweaks happen within 3 days of publication. So anyway, I'm looking at the 5-number summary of reviews after the last update time, for all reviews that are larger than the last update time:
summary(reviews$afterupdate[reviews$afterupdate >= 0])
Min. 1st Qu. Median Mean 3rd Qu. Max.
0.0 0.0 1.0 160.2 134.0 3451.0
This is a weird distribution. 54% of these post-update reviews are given either 0, 1, or 2 days post update. The true distribution I'm searching for--the distribution of reviews following addition of a new chapter, is likely somewhere between those extremes.
No pictures yet; it's past my bedtime and I am meticulous about graphs.
*Yes, you saw that maximum correctly. There is a story (unslinky's 3-million word tome "Terminal Decay") that has more reviews than the number of words in most of the stories that most people write (!). Literally, 80% of the stories on Teaspoon are 4500 words or less.
**"Updated" is the last time updated, eg, sent through the modding queue. The update date does not change with story edits. The outliers for the 1-chapter stories seem to have at one point had other chapters or notes as chapters to "bump" them up in the archive, that got either consolidated into one chapter or deleted. Thanks
no subject
Date: 2013-03-01 02:12 pm (UTC)no subject
Date: 2013-03-01 02:25 pm (UTC)For example, here are some stories that have a difference in publish vs. update date greater than 60 days, but only one chapter.
id title words updated published complete 217 The End of All Things to Come 5392 12/27/03 08/21/03 0 674 Apollo Mission 100 08/28/06 11/05/03 1 916 The Calm Before the Massage 102 08/27/04 01/14/04 1 2663 Echoes in Time 3828 08/09/05 06/01/05 1 3345 Mate, Have you got the Time? 255 06/03/06 07/19/05 1 3947 Specks of Blue 135 11/11/05 08/28/05 1 3950 A Small Christmas Celebration 776 12/10/05 08/29/05 0 4347 Just What The Doctor Ordered 950 08/29/07 10/05/05 1 6082 Simulation of Life 503 07/19/06 04/06/06 1 10046 In CUM-pany 253 04/20/07 02/01/07 0 22577 Light Through a Glass Lens 1037 08/12/08 06/10/08 1 29564 Morals That Kill 3378 09/14/09 03/02/09 1 31128 You're Fired! 183 09/05/09 05/30/09 0 43468 Wedding Shower 3037 01/26/12 09/08/11 1 44619 Blast from the Past 3374 08/29/12 12/12/11 0 45204 A Mind Like No Other 7674 03/31/12 01/20/12 0no subject
Date: 2013-03-01 02:36 pm (UTC)no subject
Date: 2013-03-01 02:52 pm (UTC)As I noted, the discrepancy between publish and update (if there is a discrepancy) for 1-chapter stories is generally small. 5085 out of 26670 1-chapter stories have a discrepancy (19%), and only 48 have a discrepancy bigger than a week. But the top 5 have big discrepancies: from 9 months to two years.
A couple things that might be going on:
1) There was a server crash at some point, so there might have been some story text lost that was re-added. I've run across a few stories with word count = 1, and in the description is added "The text of this story was lost in the server crash."
2) It could depend on how the author might make edits. That is, if they want to make a change and instead upload a whole new chapter, and then delete the older one.
no subject
Date: 2013-03-01 03:00 pm (UTC)Your theory about the publish date being affected by how someone makes edits sounds likely. If I have time this weekend, maybe I'll look at the database to see if something obvious pops out to explain the date discrepancy.
no subject
Date: 2013-03-01 09:56 pm (UTC)no subject
Date: 2013-03-01 09:52 pm (UTC)"updated" is changed whenever a new chapter is added (in the case of Teaspoon, it's when the chapter is validated). If an author submits a one-chapter story and it's initially rejected, it may take some time to be revised and go back through the queue.
"updated" isn't updated when a story is edited, but bear in mind that at some point we upgraded from efiction 1.1 to efiction 2, and 1.1 may have handled it differently.
no subject
Date: 2013-03-01 04:31 pm (UTC)These are interesting statistics though...NEEDS A GRAPH *cackles*
no subject
Date: 2013-03-01 04:55 pm (UTC)I got an account over at FF.net but the posting interface was pretty awful so I never used it. I guess since I've been hanging around Teaspoon since 2006, I am more familiar with it. I don't mind AO3 but I never just go there and check the new stories queue, for example; I'm usually clicking a link someone has posted on LJ.
no subject
Date: 2013-03-01 05:16 pm (UTC)I love that you can load to AO3 with very little fuss and cleanup, Teaspoon is a copy and paste thing. FF.Net is a little harder (pasting warnings to Word Doc and uploading and tweaking - editing is a bitch, but doable), but I guess I'm so used to posting at all three places, I never really think on it. It takes a longer time to post, at FF.Net, I'll say that - but the reviews, favs and follows are worth the hassle. It has gotten to where Teaspoon is the last place I post, some of that due to it being moderated and I dread mod queues, lol!!
no subject
Date: 2013-03-01 06:42 pm (UTC)For me, I post to LJ, any relevant LJ comms, Teaspoon and then AO3. But a lot of the stuff I put up on AO3 was a big data dump of older stories. I've not written very much new since mid 2011 (that I've posted on any archive anyway). I think Teaspoon might have an advantage over FF.net and AO3 for classic Who stories. My Sixth Doctor stories haven't gotten very much attention at all on AO3, but they were well received on Teaspoon. There is a "first posted archive" effect and an "overall activity" effect in there, though, I bet.
no subject
Date: 2013-03-01 07:09 pm (UTC)I think when everyone gets AO3 they do a data dump, lol!! I have a bad habit of leaving the 'original posting date' on my stuffs, so not a lot of hits there. Then again, it seems on FF.Net is is MOSTLY my older fic that gets hits.
Oh yeah - Teaspoon is THE place for Classic. Readers AND writers. FF.Net and AO3 are mostly 'Nu Who'.
Interesting idea...wonder if there is data to support that.
no subject
Date: 2013-03-01 04:32 pm (UTC)no subject
Date: 2013-03-01 04:49 pm (UTC)no subject
Date: 2013-03-01 05:00 pm (UTC)no subject
Date: 2013-03-01 06:02 pm (UTC)Of course, this doesn't explain why some stories have long gaps between the published and updated dates, especially in the days prior to moderation. I have a couple of theories, though. One is that some authors used to post notes as new chapters in order to "bump" their stories onto the "most recent" page. If these notes were eventually deleted when moderation started, the irrelevant content would disappear, but a gap between the "published" and "updated" dates would remain. Or some authors may have chosen to condense multiple chapters of older stories into a single chapter, creating the same effect.
Also, possibly the system was run differently prior to a certain date so that the published/updated dates don't work in the same manner as they do now.
Replacing a chapter's entire content doesn't change the "updated" date. I've experimented with doing this myself (including just a few minutes ago), and the date stays the same.
As for review counts between ff.net and Teaspoon, Teaspoon is at a disadvantage because we don't allow anonymous reviews. I'm sure authors would get a lot more feedback if unsigned comments were allowed, but then a larger amount of abusive feedback could also result.
no subject
Date: 2013-03-01 06:32 pm (UTC)Actually for some of the older stories I do see some unsigned reviews still hanging around (they show up at "NA" in the author field). And for some of them also, the chapter is an "NA"--even though the story is one chapter, more reviews show up when you hit the "All" category instead of a particular chapter. Which makes me think that yes, there was a chapter that got deleted but the reviews for it are still there, somehow. See, eg: story id 217 from the list. There is only one chapter listed but if you go to the "All" tab a second review shows up.
That is good to know about edits vs. updates and published vs. last updated. Thanks!
no subject
Date: 2013-03-01 08:52 pm (UTC)no subject
Date: 2013-03-01 09:30 pm (UTC)would it be possible (and you may have already mentioned this) to split new and classic up? because i'd assume that we were (slightly) dragging down the averages on teaspoon - i.e. i think a lot more classic people do post to teaspoon rather than ff.net and classic fics have fewer fans so thus fewer comments), but we may be such a small percentage of the total stories anyway that it isn't relevant.
hmm, anyway - it's all interesting!
no subject
Date: 2013-03-01 10:00 pm (UTC)Speaking as someone who helped validate millions of those words: neither do I. I don't think my entire fiction output from now all the way back to when I first started writing short stories many years ago hits a million words, much less three million of them.
no subject
Date: 2013-03-01 10:19 pm (UTC)I have the eras listed for the stories. Some of them have more than one so I have to decide how to classify those ones, but I should be able to create "Classic" vs. "New Who" labels and look at reviews in each. Or "Classic", "New Who", "Both" and "Other" (for things like SJA and Torchwood).
no subject
Date: 2013-03-02 11:01 am (UTC)and the compellingly named dontgiveahoot... you leave hundreds of reviews but then don't bother with the final 200 chapters?
no subject
Date: 2013-03-03 12:43 am (UTC)I put up a graph for you in my latest post of the number of unique reviewers broken down by category (First Doctor, Second Doctor, etc...). It shows that at least on Teaspoon, the bulk of the Classics are reviewed in similar numbers as the New Who, but New Who has higher outliers. And the Nine era has the highest trends of unique reviewers per story, although I mention I haven't yet controlled for time.
no subject
Date: 2013-03-05 03:25 am (UTC)no subject
Date: 2013-03-05 03:32 am (UTC)no subject
Date: 2013-03-05 03:39 am (UTC)no subject
Date: 2013-03-05 04:13 am (UTC)