December 2011

Sun Mon Tue Wed Thu Fri Sat
        1 2 3
4 5 6 7 8 9 10
11 12 13 14 15 16 17
18 19 20 21 22 23 24
25 26 27 28 29 30 31

Sponsors

« LAMP Stack Has Fewer Defects | Main | Gates to Retire »

June 14, 2006

Comments

Do you really think Wikipedia scales well? There are so many times during the day where access is so slow, not sure if that is due to the "design" (programming) of MediaWiki or what is the bottleneck.

till,

It scales very well but it's also typically been doubling in load every 3-4 months. Doing a few billion queries per day at the moment on the database side.

The bottleneck shifts around between the PHP boxes, the Squid cache servers, the database and various other parts of the system. It'll only be consistently fast when growth slows down. So far, making it fast has just encouraged faster growth until it slows down again. By way of comparison, other sites at its traffic load may typically have a thousand or two thousand servers instead of a hundred or two. Then there's Google and the hundreds of thousands... :)

Today, money is typically the best solution for Wikipedia slowness, with the vast majority of the possible tuning work already done on the software side. If growth stops at google.com levels, Wikipedia will end up needing about 10-20 million dollars more in equipment. Fortunately the donation rate has always increased with the traffic and so far the fund raising has been able to roughly keep up with the need for hardware.

And the software... not only zero charge but MySQL donated a support contract.

I was the first Wikipedia DBA and now both I and the person who took over from me are working for MySQL Support, so if you wondered what those support contracts buy, it includes people who've done the one server to billions of queries and top 20 web sites growth and tuning experience.

James Day

The comments to this entry are closed.

Copyright

  • Copyright (c) 2005-2010 M. Zack Urlocker

Google


Guitarcenter