Entries tagged with “research” from O'Reilly Radar

Fri

Nov 20
2009

Ben Lorica

Asia Continues to be Facebook's Strongest Growth Region

by Ben Lorica@dlimancomments: 0

With Facebook topping 330 million active users over the past week, the company's strongest growth region continues to be Asia. Over the last 12 weeks, Facebook added close to 17M active users in Asia alone. Since my previous post, the share of active users from Asia grew by 2% (to 13.5% of all users), and roughly 1 in 7 users now come from the region. With a market penetration under 2%, Facebook is poised to add many more users in Asia (and Africa).

pathint

Compared to the U.S., the proportion of Facebook users in their teens (13-17) or in the 18-25 age group are much higher in Asia:

pathint

As was the case in other parts of the world, expect the share of users 45 and older to climb as Facebook becomes more mainstream in Asia. Growth was strong across all age groups in Asia over the last 12 weeks, particularly among teens (+90%) and the 18-25 age group (+60%).

pathint
In other regions, notably North America, Europe, the Middle East, and South America, growth in the 18-25 age bracket, lagged behind users 45 and older.

In closing I want to highlight countries (within several regions) where Facebook has been growing rapidly:

(continue reading)

tags: facebook, hard numbers, platforms, research, social networkingcomments: 0
submit: Reddit Digg stumbleupon   

 

Thu

Nov 12
2009

Nat Torkington

Four short links: 12 November 2009

CRM on Rails, Data Mining on Hadoop, Disappointing Keynotes, The Teapot Effect

by Nat Torkington@gnatcomments: 1

  1. Fat Free CRM -- open source (Affero GPL) Ruby on Rails CRM system.
  2. Bixo -- open source data mining toolkit that runs as a series of pipes on top of Hadoop. Built on Cascading workflow system for Hadoop that hides MapReduce. (via kdnuggets)
  3. Andy Kessler's Keynote at Defrag Stank (Pete Warden) -- I'm sorry to hear it, because I loved Andy's book How We Got Here about the intersecting histories of economics, finance, and technology. Read the book instead of reading about the disappointing keynote.
  4. The Teapot Effect -- the thing I love about geeks is how their passion causes them to explore, ruthlessly and quantitatively, the everyday phenomena that the rest of us take for granted. Such as dribbling teapots: “Previous studies have shown that dribbling is the result of flow separation where the layer of fluid closest to the boundary becomes detached from it. When that happens, the fluid flows smoothly over the lip. But as the flow rate decreases, the boundary layer re-attaches to the surface causing dribbling.” Read the post and the research it talks about to learn how to prevent Dribbling Teapot Syndrome ....

tags: CRM, data mining, economics, finance, hadoop, history, open source, rails, research, sciencecomments: 1
submit: Reddit Digg stumbleupon   

 

Mon

Nov 2
2009

Nat Torkington

Four short links: 2 November 2009

Inside Botnets, Creating Choropleths, Privacy Simplified, Massively Machiavellian Online Social Gaming

by Nat Torkington@gnatcomments: 1

  1. Your Botnet is My Botnet (PDF) -- 2008 USENIX Security paper analysing >70G of data gathered when security researchers hijacked the Torpig botnet. A major limitation of analyzing a botnet from the inside is the limited view. Most current botnets use stripped-down IRC or HTTP servers as their command and control channels, and it is not possible to make reliable statements about other bots. In particular, it is difficult to determine the size of the botnet or the amount and nature of the sensitive data that is stolen. One way to overcome this limitation is to “hijack” the entire botnet, typically by seizing control of the C&C channel. [...] As a result, whenever a bot resolves a domain (or URL) to connect to its C&C server, the connection is redirected or sinkholed. This provides the defender with a complete view of all IPs that attempt to connect to the C&C server as well as interesting information that the bots might send..
  2. cartographer.js -- build thematic maps using Google Maps. To be precise, you can build a choropleth, which is my word of the day. (via Simon Willison)
  3. Making Privacy Policies Not Suck (Aza Raskin) -- interested in developing a standard set of privacy policy components the way that Creative Commons has created a standard set of copyright license components.
  4. Scamville: The Social Gaming Ecosystem of Hell (TechCrunch) -- many of those games on Facebook that your friends play are evil. To get in-game money or objects, they'll let you take a survey but at the end you're signed up for crap you never wanted. Related: this article on monetizing social networks which talks about social gaming's business model.

tags: creative commons, gaming, google maps, mapping, privacy, research, security, socialcomments: 1
submit: Reddit Digg stumbleupon   

 

Fri

Oct 30
2009

Nat Torkington

Four short links: 30 October 2009

Three Minute Theses, Google Wave RPGs, Public Metadata, and The Finitely-Zoomable Natural World

by Nat Torkington@gnatcomments: 0

  1. The3is In Three -- PhD students must explain their thesis topic in three minutes and one Powerpoint slide. Winner had written on the last words of Shakespearean characters as they met unlikely ends. No video alas, but what a great idea for an Ignite! (via sciblogs)
  2. Google Wave: We Came, We Saw, We Played D&D (ArsTechnica) -- gamers using Wave to play RPGs. This can't be the killer app, however, because it is not pornographic. (via BoingBoing)
  3. Metadata is Public Record (ArsTechnica) -- Arizone State Supreme Court rules that metadata on the public record is itself in the public record. The test case was a cop who suspected his performance reports had been created when he asked for them and then backdated. His employer had argued the inode info wasn't part of the public record, even though his report was. Sanity prevailed. (via glynmoody on Twitter)
  4. Cell Size and Scale -- sweet zoomable interface to show the different relationships in size between everything from Times Regular 12pt to a Carbon atom (via salt, E. coli, hemoglobin, etc.). (via Tom Carden on Delicious)

tags: education, events, google wave, metadata, open data, research, science, uicomments: 0
submit: Reddit Digg stumbleupon   

 

Fri

Sep 25
2009

Nat Torkington

Four short links: 25 September 2009

On Wheel Reinvention, Research Visualization, New Comments, and Defective Congressional Data

by Nat Torkington@gnatcomments: 4

  1. Diesel: A Case Study In That Thing I Just Said -- a new asynchronous I/O library in Python, which earned this fabulous review from Glyph Lefkowitz who wrote the granddaddy of all asynch libraries in Python, Twisted. Again, I don't want to dump on Diesel here; for what it is, i.e. an experiment in how to idiomatically structure asynchronous applications, it's all right. For that matter Twisted has its fair share of bugs too, which would be pretty easy to lay out in a similar post; you wouldn't even need to do the research yourself, just go look at our bug tracker. But both Diesel and Tornado make the mistake of attempting to replace the years of trial-and-error, years of testing discipline, and years of portability and feature work that Twisted has accumulated with a few oversimplified, untested hacks.
  2. Eigenfactor -- ranking and mapping scientific knowledge. Visualizations and analyses from when geeks attack scientific publishing.
  3. Washington Post Develops Visual, Web-like Commenting System -- WebCom displays comments in a dynamic web instead of a traditional list. As new comments come in, the web gets bigger. The web, however, is not organized by chronology. King and his team believe that the most valuable comments are those that are rated highly by peers and those that spur responses. WebCom uses those criteria to organize the web. (via The Evolving Newsroom)
  4. Congressional Data is Defective By Design -- You should have better access to this info! You should have — at your fingertips — immediate, unrestricted digital access to the full text of any piece of legislation the very moment it’s released publicly by Congress. This is punishingly ridiculous. Congress could immediately take steps to make all publicly-relevant legislative data comply with the community-derived Eight Principles of Open Government Data.[...] That is to say, bill info from Congress could and should be available today in real time, free of charge, open-source, and licensed openly, via such open-standards technologies as XML, API’s, and regular bulk data downloads. We're entering a time where the tools and methods that make good software can help make good laws. (via timoreilly on Twitter)

tags: gov2.0, programming, python, research, social software, transparency, visualizationcomments: 4
submit: Reddit Digg stumbleupon   

 

Thu

Sep 24
2009

Ben Lorica

There are Over a Million People Actively Using Facebook Right Now

by Ben Lorica@dlimancomments: 7

A little over a week ago Facebook reached a major milestone: 300 million active users. The fastest-growth region continues to be Asia, but growth in other overseas regions such as the Americas and Africa have also been strong. Currently reaching only 1% of potential users in Asia and Africa, Facebook has barely scratched the surface in both regions:

pathint

Growth in the U.S. remains fastest among those age 45 and older, and the share of those users is higher in the U.S. than overseas. In other regions recent growth tended to be more evenly divided among age groups. One notable exception has been the teen group in Asia, which grew over 80% in the last 12 weeks.

pathint

Of the 300 million users, how many are actively using Facebook right now? (For the rest of this post active means not just logged in, but actually engaged.) By treating the previous question as a Fermi problem, I can probably derive a decent estimate. First, I assume that the average fraction of people actively using Facebook at any moment, equals the fraction of time an average Facebook user is active on the site††. Without access to any usage stats, I'll throw out the following guesstimate: a typical Facebook user spends 4 hours per month (or 48 per year) actively using the site.

pathint

Depending on how accurate you want to be, there are 1.6 to 6 million people actively using Facebook right now. If the average Facebook user spends considerably more than 4 hours per month (actively) using the site, the estimate would be much higher than a 1.6 million. I do have an escape clause: in classic Fermi problems, being within a factor of 10 is considered acceptable.

(†) Increasingly popular in the business world, Fermi problems have long been staples in Physics (and Math) departments.
(††) In other words, if the average Facebook user spends 1% of her time actively using the site, on average 1% of all Facebook users are actively using the site at any given moment.

tags: facebook, fermi problem, hard numbers, platforms, research, social networkingcomments: 7
submit: Reddit Digg stumbleupon   

 

Wed

Aug 26
2009

Nat Torkington

Four short links: 26 August 2009

Food, NoSQL, Brain Power, Social Data

by Nat Torkington@gnatcomments: 0

  1. Better BBQ Through Chemistry -- food is the perfect ground for geek training: there are measurements, there's science, it's easy to know whether you've succeeded, and you can eat all but the worst of your failures. (via BoingBoing)
  2. NoSQL (East) -- conference on East Coast for relationless databases.
  3. Human Brain Processing Speed -- clocked at 60bits/second, according to this MIT Technology Review article. Their approach eventually led to Hick's Law, one of the few laws of experimental psychology. It states that the time it takes to make a choice is linearly related to the entropy of the possible alternatives. The results from various reaction-time experiments seem to show that this is the case. Although one byproduct of this approach is that the results are intimately linked to the type of experiment used to measure the reaction time. And that makes each study peculiarly vulnerable to the idiosyncrasies of the experimental approach. Today, Fermi Moscoso del Prado Martín from the Université de Provence in France proposes a new way to study reaction times by analyzing the entropy of their distribution, rather in the manner of thermodynamics. (via Hacker News)
  4. Truly Social Data -- Data will only be truly social when you can work with it in the kinds of ways we work with information in the real, non-computational, world. In the real world we don’t ask for permission to have an opinion on something, to add to the ball of information surrounding a concept. Our needs don’t have to be anticipated by programmers. We can share information as we please. For example, nobody owns the concept of Barcelona. If I want to essentially “tag” Barcelona as being hot, or noisy, or beautiful, I just do it. I can keep my opinion private, I can share it with certain others, I can hold conflicting opinions, I can organize things in multiple ways at the same time and give things many names.

tags: brain, data, fluiddb, food, nosql, research, science, socialcomments: 0
submit: Reddit Digg stumbleupon   

 

Tue

Aug 11
2009

Nat Torkington

Four short links: 11 August 2009

by Nat Torkington@gnatcomments: 0

  1. The Slowing Growth of Wikipedia and More Details of Changing Editor Resistance -- researchers at PARC analysed Wikipedia and found the number of new articles and number of new editors have flattened off, and more edits from first-time contributors are being reverted. This is a writeup in their blog, with the numbers and charts. It's interesting that coverage in New Scientist talked about "quality", but none of the metrics PARC studied are actually quality. Wikipedia launched a strategic review which aims to tackle this and many other issues. (via ACM TechNews)
  2. The Information Architecture of Social Experience Design: Five Principles, Five Anti-Patterns and 96 Patterns (in Three Buckets) -- teaser for upcoming O'Reilly book with some really good stuff. Balzac once wrote, “The secret of great wealth with no obvious source is some forgotten crime, forgotten because it was done neatly,” and many successful social sites today founded themselves on an original sin, perhaps a spammy viral invitation model or unapproved abuse of new users' address books. Some companies never lived down the taint and other seems to have passed some unspoken statute of limitations. (via BoingBoing)
  3. Skulpt -- entirely in-browser implementation of Python. (via Andy Baio)
  4. Why Can't Local Government and Open Source Be Friends? -- the Birmingham example is one of many. Government procurement and tendering processes are often fishing expeditions, which biases responses in favour of commercial software companies making mad margins such that they can respond to RFPs that are really RFIs, etc. It's an issue everywhere in the world because it happens at local, not just central, level.

tags: book related, government, open source, python, research, social software, web, wikipediacomments: 0
submit: Reddit Digg stumbleupon   

 

Fri

Jun 19
2009

Ben Lorica

Facebook Adds Million of Users in Asia

by Ben Lorica@dlimancomments: 1

Since my previous post on Facebook users by country, the company has grown rapidly in Asia. Over the last 12 weeks, Facebook grew 90% in Asia going from 11.4 to 21.7 million active users. With a Market Penetration of only 0.6% in Asia, Facebook has barely scratched the surface in the region.

pathint

The company also gained 11.3M users in Europe (up 19%) and 14.7M users in North America (up 21%) over the last 12 weeks. On a year-over-year basis, Facebook grew 194% (adding close to 150 million active users worldwide) from Jun/2008 to Jun/2009.

For more details, you can view regional numbers below:

tags: facebook, hard numbers, platforms, research, social networkingcomments: 1
submit: Reddit Digg stumbleupon   

 

Mon

Apr 27
2009

Nat Torkington

Four short links: 27 Apr 2009

Data centers, open research, Jeopardy!, and tombstones

by Nat Torkington@gnatcomments: 5

  1. Google Server and Data Center Details -- Greg Linden reports on a Efficient Data Center Summit. Google uses single volt power and on-board uninterruptible power supply to raise efficiency at the motherboard from the norm of 65-85% to 99.99%. There is a picture of the board on slide 17. (and this is a 2005 board). Greg has left Microsoft as Live Labs is dissolved.
  2. The Economics of Open Access Publishing -- set of papers on the free distribution of research. Pointed to by the RePEc blog. RePEc is Research Papers in Economics, a collaborative effort of hundreds of volunteers in 67 countries to enhance the dissemination of research in economics. The heart of the project is a decentralized database of working papers, journal articles and software components. All RePEc material is freely available. (via Paul Reynolds)
  3. Computer Program to Take On Jeopardy! (NY Times) -- move over Turing Test, IBM's working on the Trebek Test: a computer program to compete against human “Jeopardy!” contestants. If the program beats the humans, the field of artificial intelligence will have made a leap forward. Really? The system must be able to deal with analogies, puns, double entendres and relationships like size and location, all at lightning speed. Oh, ok. So it's more complex than inverting the hash table of questions and answers. (via ericries on Twitter)
  4. The Value of Minimal Data (Powerhouse Museum) -- if you have the ability for passionate users to contribute their knowledge, they can turn "minimal" data into a delicious four course data feast with a vintage port to sip during the dessert course. (via sebchan on Twitter)

tags: collective intelligence, energy, open access, open data, power management, researchcomments: 5
submit: Reddit Digg stumbleupon   

 

Sun

Apr 19
2009

Ben Lorica

Active Facebook Users By Country

by Ben Lorica@dlimancomments: 24

Since I last posted numbers on Facebook's user base six week ago, the company has added close to 20 million active users.

pathint

I've had a few requests for detailed numbers by country so I quickly assembled an update for each of the regions shown above.

(continue reading)

tags: facebook, hard numbers, platforms, research, social networkingcomments: 24
submit: Reddit Digg stumbleupon