Webpulp.tv

A video podcast that explores
technology that powers the web
Jacob Harris New York Times

Using Varnish to scale NYtimes.com

I had a chat with Jacob Harris to talk about New York Times, working on the interactive newsroom team, and how they use varnish to scale their apps without breaking a sweat.

Show Notes

  • The interactive newsroom team makes interactive apps that go up on the NYTimes home page: elections, oscars, olymipics, etc.
  • News editing staff with done thing such as election, water polluters, football player, travel, face a lot of charms and challenges but its fun.
  • They are using Rails Apps for 3 years running with EC2 tools couple of proxy come in and ice of varnish of cache in the traffic.com
  • They use MySQLfor databases and it's awesome according to Jacob
  • The Mango IT is very useful for photo uploads forms, data that could anything, pictures, and colors. Also used in Twitter hacker but mostly in New York Times
  • In easy tunes, they have few backend server applications a loader and bouncer and 1 varnish cache. Miscellaneous machine, Mysql- RDS mostly use.
  • They looking for using Red-Hot Proxy Season and Proxy Spot
  • If there's a events like election and Oscar award night they use proxy, bc2, 3 or 4 ec 2, 4 additional web server, for traffic and a large of ec2 proxy different from day to day.
  • New York times have a bunch of 40 live application, mango, proxy configuration, for market folks sever. They use varnish for cache, it’s a good deal to the traffic.
  • Varnish is easy to set up it has VCL language, it’s a DSL, and main feature of Varnish is its configuration language. When the program comes in it takes action and if your retrieving on the backend it easy to store cache.
  • Varnish is technically single point of failure. But it’s a way of correcting. Central point cache goes to application and it’s very quick, do not need to go further.
  • Varnish is very powerful to decomposing the site in a small sub page in the substituting together in cache with separate time outs, different TTPL, can send full dynamic in the cache of the site.
  • Olympics apps, New York Times have a graphic group; they are people good in JavaScript and do a lot of work with Flash Apps,and maps wizard for IPod.
  • They have 1 varnish to all of the traffic.
  • New York Times Request per day, 30 to 60 hits for seconds for traffic.
  • Varnish can used for open source C 10K caches.
  • Varnish cache control is private stuff.
  • Varnish VSL, browser can be cache things and sometimes its use by JavaScript or a flash thing by adding step on request.
  • In general, varnish follows the same logic a browser cache, it’s a browser header. It cache the full URL, can check longer, with particular info like PPIT, CSC, Google, CC Sql, which cached keys.
  • Jacob said that Varnish is his favorite cached tools right now.