Entries tagged with “scale” from O'Reilly Radar

Thu

Oct 22
2009

Nat Torkington

Four short links: 22 October 2009

Cognitive Surplus, Scaling, Chinese Blogs, CS Education for Growth

by Nat Torkington@gnatcomments: 1

  1. Eight Billion Minutes Spent on Facebook Daily -- you weren't using that cognitive surplus, were you?
  2. How We Made Github Fast -- high-level summary is that the new "fast, good, cheap--pick any two" is "fast, new, easy--pick any two". (via Simon Willison)
  3. Isaac Mao, China, 40M Blogs and Counting -- Today, there are 40 million bloggers in China and around 200 million blogs, according to Mao. Some blogs survive only a few days before being shut down by authorities. More than 80% of people in China don’t know that the internet is censored in their country. When riots broke out in Xinjiang province this year, the authorities shut down internet access for the whole region. No one could get online.
  4. Congress Endorses CS Education as Driver of Economic Growth -- compare to Economist's Optimism that tech firms will help kick-start economic recovery is overdone.

tags: blogging, china, economy, education, facebook, infrastructure, scalecomments: 1
submit: Reddit Digg stumbleupon   

 

Fri

Jul 10
2009

Nat Torkington

Four short links: 10 July 2009

Network File System, Internet Use, Lovelace Comic, Search User Interfaces

by Nat Torkington@gnatcomments: 0

  1. Ceph -- open source distributed filesystem from UCSC. Ceph is built from the ground up to seamlessly and gracefully scale from gigabytes to petabytes and beyond. Scalability is considered in terms of workload as well as total storage. Ceph is designed to handle workloads in which tens thousands of clients or more simultaneously access the same file, or write to the same directory-usage scenarios that bring typical enterprise storage systems to their knees. (via joshua on delicious)
  2. Daily Internet Activities, 2000-2009 -- Pew Charitable Trust's Internet usage survey. We've finally broken 50% of Americans using the Internet daily. Twitter is almost a rounding error. (via dhowell on Twitter)
  3. The Thrilling Adventures of Lovelace and Babbage -- fantastic comic, with end-notes that explain how Babbage and Lovelace's lives and works are reflected in the action of the comic. (via suw on Twitter)
  4. Search User Interfaces -- full text of this book about the different (successful and un-) interfaces to search. (via sebchan on Twitter)

tags: history, scale, search, twitter, uicomments: 0
submit: Reddit Digg stumbleupon   

 

Fri

Jun 19
2009

Scott Ruthfield

Announcing: Spike Night at Velocity

by Scott Ruthfield@scottrucomments: 5

Guest blogger Scott Ruthfield is a Program Committee member of the O'Reilly Velocity: Web Performance & Operations Conference. 


Web Operations is not for the casual observer: it's for a particular kind of adrenaline junkie that's motivated by graphs and servers spinning out of control.  Jumping in, on-your-feet analysis, and experience-based-experimentation are all part of solving new problems caused by unexpected user and machine behavior, and keeping a clear head when service owners and executives are panicking is part of the job. 

A core part of operations leadership is spike management - what you do when you see a significantly larger amount of load than you've had before. Sometimes this is predictable months out (Amazon knows, for example, that the first or second Monday of December will be their biggest day each year), sometimes days out (Twitter knew Oprah was coming), and sometimes not at all (what we still call the Slashdot Effect). Every web ops professional deals with some kind of spike - even intranets manage paydays and employee review days - and if you're into it, well, spikes can be fun. Of course, maybe you use EC2 Auto-Scaling, and so (in theory) don't have to worry about it, although of course bottlenecks come in many forms.

So at Velocity this year, we're trying out something new: Spike Night.

Spike Night is a chance to see and learn about how real, high-traffic websites deal with massive increases in load, either expected or unexpected. We'll see real-world management of traffic increases - graphs, tools, the whole shebang.

Now, it turns out that when I called up lots of people on the phone and said "can we throw massive load at your website so you can stand on stage and brag about it," many web ops folks were excited, but then they start worrying about little things like "what if something goes wrong and everyone blogs about it" or "do I have to ask somebody in a PR department" and then calls went unreturned. 

Fortunately, two parties have stepped up, and I can't wait to see what they have to show:
  • Chris Bissell, Chief Software Architect at MySpace, and members of the MySpace team will demonstrate a massive, real increase in traffic, and will manage it on-stage. MySpace already deals with tens of thousands of hits each second - we can't throw enough traffic at them to cause any harm - so they'll cause their own harm and then show how they work through it.
  • Ryan NelsonOperations Director for MLB Advanced Media and MLB.com, will walk us through a combination of war stories and live traffic management to show what happens when millions of baseball fans all want to see what's happened after the commercial break at the exact same time. Between their very popular desktop apps and their newly-announced iPhone game streaming, the MLB is a true leader in technology innovation with a rabid fan base that goes well beyond the Web 2.0 echo chamber.
Spike Night is meant to be a fun event, taking place Tuesday June 23rd @ 7:30PM at Velocity, and open to the larger web community - a Velocity conference pass is not required to attend. I'm looking forward to hosting interesting demos and a fun Q&A, and hope to see all of you there!

tags: cloud, infrastructure, operations, performance, scalability, scale, spikenight, velocity, velocity09, velocityconf, web2.0, webopscomments: 5
submit: Reddit Digg stumbleupon   

 

Mon

Jun 8
2009

Nat Torkington

Four short links: 8 June 2009

3D Geometry, The Printable Web, Government Internet Fail, and Real World Cloud Computing

by Nat Torkington@gnatcomments: 2

  1. How to Project on 3D Geometry -- the fine art (and math) of distorting an image so that it looks undistorted when projected onto a non-flat 3D surface. Confused? See the images below. (via straup on Delicious)
  2. ZinePal -- Create your own printable magazine from any online content. (via warrenellis on Delicious)
  3. What The Government Doesn't Understand About The Internet And What To Do About It -- Tom Steinberg from MySociety lays it out. As true for US, NZ, and every other country as it is for the UK (for which it was written). Accept that any state institution that says “we control all the information about X” is going to look increasingly strange and frustrating to a public that’s used to be able to do whatever they want with information about themselves, or about anything they care about (both private and public). This means accepting that federated identity systems are coming and will probably be more successful than even official ID card systems: ditto citizen-held medical records. It means saying “We understand that letting train companies control who can interface with their ticketing systems means that the UK has awful train ticket websites that don’t work as hard as they should to help citizens buy cheaper tickets more easily. And we will change that, now.” What I like about Tom vs the US's Gov 2.0 is that Tom puts down philosophy that's hard to argue with, whereas the US is dangerously close to simply focusing on techniques and that's subvertible.
  4. Real World Cloud Computing -- summary from a panel of startups who are using EC2. The lock-in is latency. Transfering data within the Amazon services is free. Transfering data to an Amazon competitor: not free.
Sample distorted and undistorted images

tags: amazon, book related, cloud computing, ec2, gov 2.0, government, programming, scale, webcomments: 2
submit: Reddit Digg stumbleupon   

 

Fri

May 8
2009

Jesse Robbins

Velocity 2009 - Big Ideas (early registration deadline)

by Jesse Robbins@jesserobbinscomments: 7

what-is-velocityconf.png

(tag cloud created from Velocity session & speaker information using wordle.net)

My favorite interview question to ask candidates is: "What happens when you type www.(amazon|google|yahoo).com in your browser and press return?"

While the actual process of serving and rendering a page takes seconds to complete, describing it in real detail can take an hour. A good answer spans every part of the Internet from the client browser & operating system, DNS, through the network, to load balancers, servers, services, storage, down to the operating system & hardware, and all the way back again to the browser. It requires an understanding of TCP/IP, HTTP, & SSL deep enough to describe how connections are managed, how load-balancers work, and how certificates are exchanged and validated... and that's just the first request!

Web Performance & Operations is an emerging discipline which requires incredible breadth, focusing less on specific technologies and more on how the entire system works together. While people often specialize on particular components, great engineers always think of that component in relation to the whole. The best engineers are able to fly to the 50,000 foot view and see the entire system in motion and then zoom in to microscopic levels and examine the tiny movements of an individual part.

John Allspaw recently described this interconnectedness on his blog:

With websites, the introduction of change (for example, a bad database query) can affect (in a bad way) the entire system, not just the component(s) that saw the change. Adding handfuls of milliseconds to a query that’s made often, and you’re now holding page requests up longer. The same thing applies to optimizations as well. Break that [bad] query into two small fast ones, and watch how usage can change all over the system pretty quickly. Databases respond a bit faster, pages get built quicker, which means users click on more links, etc. This second-order effect of optimization is probably pretty familiar to those of us running sites of decent scale.

Working with these systems requires an understanding not only of the way technology interacts, but the way that people do as well. The structure, operation, and development of a website mirrors the organization that creates it, which is why so many people in WebOps focus on understanding and improving management culture & process.

Organizing a conference like Velocity is a wonderful challenge because it requires the same sort of thinking. We focus on the big concepts that everyone needs to know and then go deep into the technologies that change our understanding of the system. We find ways to share the unique experience that can only be gained by operating at scale. We make it safe to share as much of the "Secret Sauce" as we can.

Please join us at Velocity this year, we have an amazing lineup of speakers & participants. Early registration ends on Monday, May 11th at 11:59 PM Pacific. (Radar readers can use "vel09cmb" for an additional 15% discount.)

Velocity, the Web Performance and Operations Conference 2009

tags: cloud, data, infrastructure, operations, scale, velocity, velocity09, velocityconf, web, web2.0comments: 7
submit: Reddit Digg stumbleupon