There’s an unspoken truth lurking behind the scourge of Big Data and the heralding of Hadoop as its savior: While Hadoop shines as a processing platform, it is awkward as a query tool. Hive was developed by the folks at Facebook in 2008, as a means of providing an easy-to-use, SQL-like query language that wouldContinue reading “beyond hadoop: fast queries from big data”
This week’s Oracle World was bracketed by two events. First: the unveiling of Oracle Exalytics, a beefy in-memory appliance dedicated to large-scale analytics, during Larry Ellison’s opening keynote. Second: the undressing of Oracle’s cloud computing initiatives by Marc Benioff, SalesForce’s CEO, and the unceremonious cancellation of his keynote on Wednesday morning. Both events highlight thatContinue reading “how Oracle, the Goliath of data, could stumble”
A couple of weeks ago, I was drinking beer in San Francisco with friends when someone quipped: “You have too many hipsters, you won’t scale like that. Hire some fat guys who know C++.” It’s funny, but it got me thinking. Who are the “fat guys who know C++”, or as someone else put it, “theContinue reading “the secret guild of silicon valley”
The Big Data Stack, from my piece, Building Data Startups at O’Reilly Radar.
Slides from my presentation from O’Reilly’s Strata Online Conference on May 25, 2010.
This past February, I moderated an event at Stanford about Predictive Analytics. I led with a brief introduction, followed by a discussion with Omar Tawakol, CEO of Bluekai, Scott Burke of Yahoo!, Matt Barkoff, VP at Badgeville, and Theresia Gouw Ranzetta, Partner at Accel Partners. See the YouTube video here.
The future of the web is data, not documents. The web has evolved from Tim Berners-Lee’s original vision of “some big, virtual documentation system in the sky”into an vibrant ecosystem of data where documents — and human actors — will play an ever smaller role. As others have noted, we’ve reached a tipping point in history:Continue reading “the rise of the data web”
At O’Reilly’s “Making Data Work” seminar earlier this summer, I teamed up with a few other folks (data diva Hilary Mason, R extraordinaire Joe Adler, and visualization guru Ben Fry) to talk about data. What follows is a blog-ified and amended version of that talk, originally entitled “Secrets of Successful Data Scientists.” 1. Choose TheContinue reading “the seven secrets of successful data scientists”