Perhaps if we lived on a crest, things would be different. - LiveJournal is shrinking. In new faux-academic style! [entries|archive|friends|userinfo]
Pronounced "Pie Ropp"

LiveJournal is shrinking. In new faux-academic style! [Aug. 5th, 2007|09:06 pm]
Previous Entry Add to Memories Tell a Friend Next Entry
[Tags|]
[Music |DJ Shadow - The Number Song]

(If you're on my friends list, you've seen most of this already.)

LiveJournal is shrinking, at least in terms of active accounts. The number of active accounts reached its peak in April of 2005, and has been decreasing ever since then. Let's take a look at the graphs.


Figure 1.

Figure 2.

Figure 3.

Figure 4.

It may save you some scrolling back and forth to open the gallery of these images in another tab or window or something.

Methodology.

The total accounts data comes from stats.txt, the rest was scraped from archive.org's archive of stats.bml.

(This section becomes increasingly boring after this point.)

Archive.org's archiving of stats.bml is sort of sporadic: years 2000 through 2003 are sampled 4 or 5 times a year, but then 2004 and 2005 are samples more than 100 times during the year, sometimes more than once per day. The data obtained from archive.org was cleaned to remove repeated data (from when stats.bml hadn't updated between snapshots) and other bogus data.

Lines between data points are meant only to be a guide for the eye, not represent what the actual data there should be. There is one fake data point for everything but total accounts inserted at the day when the invite-only period ended (December 12, 2003). This was done because the data points near this day are fairly far apart, and a linear approximation overestimates the increase before that date. This point was determined by assuming all other quantities grow before that day at the same rate that total accounts grows. Again, this point should only be taken as a guide for the eye. Data points aren't shown because they'd run together and it wouldn't actually be helpful. There's a link to the data set at the end of this post, for anyone who wants to see the data points themselves.

The "Active accounts" number was introduced on February 26, 2003 (specifically, in revision 1.25 of stats.pl in the LJ source code.) Obviously, there is no data for "Active accounts" before then. The way stats.pl finds that number was changed in revisions 1.46 and 1.47 on December 14, 2005. The numbers it now comes up with are much smaller than those it came with before. The data from before revision 1.46 is referred to as "old calculation" and after is referred to as "new calculation." It is possible that revision 1.29 changed the number that stats.pl comes up with, but i can't tell from looking at it.

Meaning of quantities.

  • Total accounts is the total number of accounts that have ever been created on LiveJournal. This include deleted accounts. Equivalently, this is the user number of the most recently created user.
  • Accounts that have ever been updated is pretty much what it sounds like. I'm not sure if this includes accounts that updated before they were deleted. (Maybe someone more familiar with the LJ codebase can answer that.)
  • Active accounts, last 30 days is the number of accounts that have had anything happen to them in the database in the past 30 days, such as updating or uploading a userpic or changing their profile etc. The way this was calculated was changed on December 14, 2005 as described in the previous section. The numbers it comes up with now are much lower.
  • Updated last... is another straightforward one. It should be noted that short-term variations in "updated last day" don't really mean much, since people tend to post more on certain days of the week. This is somewhat true for "updated last 7 days" too. (There tends to be a slight downturn in "updated last 7 days" at the end of every year near Christmas and New Years, presumably due to people being busy with holiday-related things then.)

I have included the dates of three significant dates for LJ. The invite-only period had an obvious effect on LJ's population growth. The inclusion of the date of 6A's purchase of LJ is included for curiosity's sake, and shouldn't necessarily be taken as suggesting or not suggesting that 6A's purchase of LJ determines the population growth/non-growth of the following period.

I'm going to implicitly make the assumption from here on that proportional changes to any of the activity measures correlate to proportional changes in number of users. I don't think this is unreasonable, but I also don't think it's necessarily an extremely strong correlation.

Discussion

LiveJournal experienced a near-continuous trend of growth in the number of active accounts, neglecting small bumps, until April of 2005. All measures of account activity peaked at various points within the month of April. There has been a consistent decreasing trend since April of 2005.

NumberDateValue at peak
Active accounts last 30 days (by old calculation)April 22 20052662552
Accounts updated in the last 30 daysApril 14 20051534960
Accounts updated in the last 7 daysApril 5 2005 985495
Accounts updated in the last dayApril 5, 2005 380810

Table 1. Days of the peaks of activities and their values there. Not that, due to our not having a continuous data set for these numbers, the actual peaks might have been a little higher and at a nearby date.

The current numbers, at the time that I'm writing this, of accounts updating in the last 30 days is 985313. This mean that there has been a 36% decrease in the number of accounts updating in the last 30 days since the peak, or a decrease of 11% per year. Unfortunately, comparing the active accounts number is sort of apples-to-oranges because of the change made to how they calculate the active accounts.

This decrease raises the question; how can the total number of accounts on LJ be growing while the number of active accounts shrinks? What is probably going on is a balance of users, e.g. new users replacing those who leave. Let's examine the rate of account creation. We get a very clear picture of this from stats.txt.


Figure 5. Account creation versus time. The marks indicating the beginning and end of the invite-only period and LJ's purchase are in the same places as before.

There are some interesting peaks in this graph. Once the invite-only period ends there's a huge spike in the account creation rate. (14842 new users the day LJ opened up, and a whopping spike of 17630 new users/day on December 14, 2003! yow!) There is another peak on July 19, 2004 of 13609 new users/day, and one on January 16th, 2005, right after the 6A purchase, of 14175 new users/day, preceding a long downwards trend. There's another spike at 12088 on February 22, 2006, and then the new user rate seems to stay fairly constant.

Interestingly, quite a few of these spikes have corresponding features on the activity graphs Figure 3 and Figure 4. Both "active accounts" and "updated last 30 days" have small bumps a month or so after the spikes in user creation. The post-6A downwards trend is reflected in the activity graph, though delayed by a few months. It is worth remembering that decreases in the number of users updating will not show up in "updated last 30 days" or "active accounts" since they, well, consider every account that's updated or has activity in the last 30 days, but even considering that these features still seem to lag behind some. There are other features that probably correlate. (There's a big drop in November of 2006 corresponding to the power outage then.)

Conclusions.

First and foremost; It is completely wrong to say that LJ has 13 million users. LJ only has a larger population than the state of Illinois, as claimed here, if you're running things like an old-style Chicago election and are counting the deceased and people who don't live there anymore. LJ's actual population, according to active accounts, has probably never been much higher than 2.6 million, and is currently near 1.7 million, which is still a decent number. It's more than the population of Phoenix, Arizona or Montreal.

We've (I've been trying to avoid the academic we this entire post) fully established that LJ is shrinking in terms of account activity. This suggests LJ is shrinking in general, but there are other aspects of LJ that ought to be analyzed before we can say, full-out, that LJ is shrinking. The two main aspects I'm thinking of are posts per day and number of paid or permanent accounts. For the former, it is entirely possible that today's breed of LJers, though less numerous, post more. For the second, it could be argued that the ones who would pay for LJ are the really important users (certainly from 6A's perspective). These are two quantities that used to be on stats.bml, but posts per day was removed in 2003 when it became too big of a burden on the servers, and the amount of users paying for LJ was removed near the end of 2005 for unstated reasons. There are all sorts of interesting data one could get with unlimited access to the database, which is something people who aren't employed by 6A don't have, and getting these numbers by a random method would probably be interpreted as a DoS attack on the servers.

The users who are leaving are either giving up on LJ and blogs entirely or going to other services. Or, rarely, dying or falling into comas. That's a tautology; Call it the Law of Conservation of Users. I remember there being an entry in Anil Dash's blog at some point which had a clustermap of the "blogosphere" (gag) where LJ was off on it's own little glob, so LJ's fairly insular. I was wondering if they might have went off to other sites using the LJ software, so I counted, and GreatestJournal, DeadJournal, InsaneJournal, and JournalFen combined have 59735 accounts that have updated in the past 30 days. That's 3.9% of LJ's peak, and 11% of all of the decrease from the peak to now. Even if every one of those services had a userbase consisting entirely of people who left LJ after April of 2004, that doesn't account for everyone who has left. Other blogging sites aren't nearly as straightforward with their statistics; Vox has none at all, and I can't find any on Blogger or whatnot either. In general, this is the kind of data that people pay market analysts to come up with, and would probably require a full analysis of the field of blogging/social networking software and websites. While I seem to have enough free time to write this mess, I don't have enough free time to do 6A's market research for it.

Though some people would like to jump on the chance to blame 6A, it is premature to say that 6A has been causing the decrease in number of active accounts. The data I have is suggestive, but when it comes down to it, the suggestion is weak. Off the top of my head, I can think of a bunch of other reasons, such as a net-wide decrease in the popularity of casual blogging, or previously active LJ users growing up and getting jobs or going to grad school or doing other things that prevent them from LJing.

I'm afraid that I'm unable to answer the question, "Why is LJ shrinking?" with the data I have available to me. This is, really, one of those questions that will never have a full answer. A more meaningful question to ask might be "How do we make LJ stop shrinking?" which is a question that 6A ought to answer and implement. I think that, while LJ is still of a healthy size, having a declining user base is not good for it.

My next project is going to be to wring whatever other information i can out of the archive.org stats.bml. Gender distribution versus time, age distribution versus time. The data here seemed like the most important part, though, so i did that first.

Acknowledgments.

Thanks to everyone on my friends list who commented on this when it was in its formative stages. Thanks to Brad Fitzpatrick for making the LJ source code public so that I could go root around in it for answers. Thanks to archive.org for doing what they do and letting me use their snapshots. Last but not least, thanks to the team that runs LJ for making the user statistics public.

The graphics in this post were made in the software Plot for Mac OS X. The data for this is available in an excel file here, for anyone who wants to fool around with it.

The post i made in the mathematics community relating to this data has some interesting comments.

LinkReply

Comments:
Page 1 of 2
<<[1] [2] >>
[User Picture]From: [info]ellie
2007-08-06 03:13 am (UTC)

(Link)

Of course I can only speak from personal experience, but I think there are a few reasons for the decline.

First and foremost, most tweens are going to myspace. My cousins and their friends are in highschool/middle school, and they're myspacers. They can post pictures of their parties and their prom and graduation and everyone can see and spew "OMG's." College kids are off on facebook. Last week's time magazine did a pretty interesting essay by Steven Levy on the economic structures of facebook and myspace.

Secondly, I think that there have been some pretty public blunders about LJ recently. I'm thinking the banning of breastfeeding icons and the recent purge of people with "child rape" as an interest are the two big ones.

My own personal friends list is shrinking in active users too. Most of my friends are in their mid twenties. They've got jobs, Masters programs, families... you know, a life outside the internet. I post daily because I use it as my personal journal. It's just easier to keep track of on a database than a bunch of notebooks. But a good many of my friends don't see it that way. Why keep track of the goings of your friends online when you can call them up? Oddly enough, most of my friends page are people I know IRL.

These are just my own person thoughts... I have no idea how you could prove them valid or invalid.
[User Picture]From: [info]ronaldraygun
2007-08-06 03:35 am (UTC)

(Link)

Migration to new services such as Twitter, MySpace, Facebook and Vox probably accounts for a fair amount of loss of usership. There have always been competitors to Livejournal such as Xanga and Deadjournal as well as more "professional" blogging services such as Blogger, Wordpress and Typepad. That being said, though, it'd be hard to attribute recent decline in activity to migration to other blogging services. I think it's just the fact that internet kids want to be at the cutting edge of online social interaction and are finding new services that are not necessarily better but different.

If you look at the 6A purchase date and activity, it looks like they got a raw deal.

It'd be interesting to put launch dates of Twitter, MySpace and Facebook on the same graph and see if that could have an impact. Of course, you'd probably need usage statistics from MySpace et al to see if there's actually a relationship.
[User Picture]From: [info]remix79
2007-08-06 03:43 am (UTC)

(Link)

Wow! Thanks for all the hard work that went into this post. You should definitely bring this post to 6A's attention (if you haven't already).
[User Picture]From: [info]miang
2007-08-06 03:53 am (UTC)

(Link)

Very interesting analysis, although (not that you can help it) a bit premature to see the effects of Strikethrough and its aftershocks, I think. As far as general usage trends go, I think you're absolutely right that the real story is going to be in the interactions (gender by year, age cohort by year, etc.), and I wish you luck in getting that data.

As for Strikethrough, it's going to take some time for the full impact to be known. Besides the people who've already jumped ship, there are the people who need three to five strikes to decide to go, the people who need all their friends to go before they're willing to move, and the people (possibly the largest category, actually) who haven't gone but are simply refusing to continue paying for the service -- some of whom will eventually go when they decide they hate the free service after being used to paid perks for so long. I'm in the latter group, myself. And all these people will continue to be active until they can get a communication strategy worked out between themselves and their entire friends lists, comprised themselves of people in various stages of staying or leaving. :)
[User Picture]From: [info]joxn
2007-08-06 06:21 am (UTC)

(Link)

Can you do that first graph with a log y-axis?
Re: Ikwn - (Anonymous) Expand
[User Picture]From: [info]myth
2007-08-06 07:31 am (UTC)

soon it'll just be all of us with user IDs less than 5k

(Link)

This is exceptional - informative and interesting. Keep it up ... I'm definitely interested in reading the gender/age distribution statistics.
[User Picture]From: [info]torgo_x
2007-08-06 09:19 am (UTC)

Raw Encouragement

(Link)

To quote the Talking Heads, "and when things fell apart / nobody paid much attention". Except, yes, you're paying very close and careful attention, and I very much appreciate that.
[User Picture]From: [info]acwmaiden
2007-08-06 02:04 pm (UTC)

(Link)

Good luck on the age and gender stuff :-)

My crazy brain actually enjoyed reading through all that and scrolling back and forth to see what you were referencing in all your graphs :-D
[User Picture]From: [info]gretzilla
2007-08-06 08:18 pm (UTC)

(Link)

I agree wholly and completely with [info]myth. This is exceptional. And I am totally in love with your nerditude. Thank you for your well thought-out data crunching and analysis.
[User Picture]From: [info]lavendertook
2007-08-07 03:02 am (UTC)

(Link)

Thanks for all tis info! Do you have any idea if LJ/6a uses active Lj stats for attracting advertisers or if it affects their ad revenue? I have no idea where to go for this info.

I'm wondering if how many entries active users are making during a time period is a more important stat than other activity, like commenting, uploading pics,icons,managing formats, etc. If a big enough block of people stopped making entries for a while, would it affect LJ, or would those users have to delete their accounts to have any impact?

[User Picture]From: [info]slashpine
2007-08-07 03:53 am (UTC)

(Link)

Wonderful presentation! I am blown away by the work you've done -- which few of us could carry out, but all will be interested in! If there are fandom Service Awards, your nomination is assured.

It could be my eyes at this point in the day, but it seems to me that Table 1 has a typo on the year?

The lead-in sentence: There has been a consistent decreasing trend since April of 2005.

But the table rows (horribly garbled by my lack of html fu) has all 2004 dates:

Number Date Value at peak
Active accounts last 30 days (by old calculation) ____ April 22 2004 __ 2662552
Accounts updated in the last 30 days ____________ April 14 2004 __ 1534960
Accounts updated in the last 7 days_____________ April 5 2004 __ 985495
Accounts updated in the last day _______________ April 5, 2004 __ 380810

Table 1. Days of the peaks of activities and their values there. Not that, due to our not having a continuous data set for these numbers, the actual peaks might have been a little higher and at a nearby date.


Going back to reading now - I was just getting to the juicy parts! Not that sheer data doesn't have (to quote Joe Stalin) a "quantity that is a quality all its own." So again, my boggle-eyed and wild applause for your work compiling and interpreting this!!
[User Picture]From: [info]modillian
2007-08-07 04:12 am (UTC)

(Link)

LJ only has a larger population than the state of Illinois, as claimed here, if you're running things like an old-style Chicago election and are counting the deceased and people who don't live there anymore.
I love you. *pumps Chicagonian fist* Be proud of your continuous and notoruous corruption, baby! :P Um whoops, maybe that's not exactly the compliment LJ has been looking for...
[User Picture]From: [info]slashpine
2007-08-07 04:47 am (UTC)

(Link)

Again, lovely work! Thank you so much. Some random thoughts --

Finding out where the "drop-outs" go is always hard, since by definition you see them only by their absence. One possible intercept of one part of that group (if you cared!) would be via polling in LJ of continuing users to ask where friends who've left have gone, as well as the "just disappeared" category. Many universities do this when students just vanish -- the results are always indicative.

One result my university found in investigating "non-returning" sophomores and juniors was the number who turned out to have formed *no* personal connections to speak of while they were at our school; those who failed to form strong connections were more likely to drop out. That community=continuity factor probably operates in LJ as well... certainly in fandom.

Couldn't the entire "rise and fall" scenario just be an early-adopter/ passing fad kind of thing? I should think that's huge for anything high-tech.

Some have noted that for many, LJ or other SNS's are "just another form of email." Or less portable video cellphones. I may be reaching, but it seems to me there might be some fall-off when gmail and other more high-powered programs kicked in. A lot of my academic colleagues soured early on websites due to the work, and tend not to blog due to the exposure; but their arguments against SNS are both of those plus the same as our students: why go to the work when your cell and email handle so much of this?

Anyway - all very thought-provoking, and I look forward to seeing your further developments of the analysis!
[User Picture]From: [info]samtaburetkin
2007-08-07 06:32 am (UTC)

(Link)

There are some segments of LJ that have life of its own. I came to this platform following a crowd of Russians (livejournal.com was recognized as the website that had the greatest impact on real life a couple of years ago in one of the most established Russian surveys of cyberspace in 2004 or 2005) -- and they are no teens, I would tend to think the median age would be closer to 30. If you have easy access to the usage data, and can isolate the journals where posts are in Russian only, you would probably see a rather different dynamics, although the spike at "going public" would be even more pronounced -- no sane Russian will pay a Western programmer like [info]avva with the real credit card money :))
[User Picture]From: [info]insomnia
2007-08-07 11:57 am (UTC)

(Link)

I've been tracking this for a long time now, even before this post, but it's nice to see some graphs on it.

You may want to consider using some of my data as well, as it might make your graphs more accurate.
[User Picture]From: [info]cupidsbow
2007-08-07 02:47 pm (UTC)

(Link)

Fascinating. Thank you.
[User Picture]From: [info]tehsunny
2007-08-07 08:01 pm (UTC)

(Link)

This is fascinating. I'm a Sociology major at the University of Pennsylvania and the functioning of this kind of virtual community is one of the things I'm interested in studying closely... The data here is awesome, both in how it's presented and in terms of the further study it suggests.

Thanks very much for pulling it all together. Great work. :D

Oh, and I'd like to friend you if that's okay, just to see what you do with this next.
[User Picture]From: [info]gomezticator
2007-08-07 08:54 pm (UTC)

(Link)

LJ rarely deleting abandoned accounts probably has something to do with the continued rise in the face of declining usage.
[User Picture]From: [info]aristaea
2007-08-08 01:45 am (UTC)

(Link)

I was linked here from . . . somewhere I can't recall just now, but this is really interesting. I'm a psych person, so I love statistics detailing social behaviors, and I hope you don't mind if I add this to memories.
[User Picture]From: [info]msilverstar
2007-08-08 02:06 am (UTC)

(Link)

I've been thinking about it, and I'm pretty sure that people are going to MySpace and Facebook and even the community parts of Flickr and YouTube.
[User Picture]From: [info]nomadmatan
2007-08-08 06:55 am (UTC)

(Link)

Hey, now you're internet famous :o)

When are you visiting us?
[User Picture]From: [info]clauclauclaudia
2007-08-08 09:13 am (UTC)

(Link)

Thank you for this.

One broken link: http://www.livejournal.com/stats.txt should be stats/stats.txt (IIRC, but it's definitely wrong as is).
[User Picture]From: [info]mynn
2007-08-08 01:40 pm (UTC)

Fascinating, and thank you.

(Link)

Question - any way to account for spam? For a while, I "collected" spammers and reported them to LJ from time to time, only a couple hundred on bored days.

http://www.livejournal.com/stats/latest.bml
[User Picture]From: [info]stardance
2007-08-08 03:51 pm (UTC)

(Link)

This was really interesting. Thank you for posting this.
Page 1 of 2
<<[1] [2] >>