AWS News Blog
New Elastic MapReduce Goodies: Apache Hive, Karmasphere Studio for Hadoop, Cloudera’s Hadoop Distribution
Earlier today, Amazon’s Peter Sirota took the stage at Hadoop World and announced a number of new goodies for Amazon Elastic MapReduce. Here’s what he revealed to the crowd: Apache Hive Support Elastic MapReduce now suports Apache Hive. Hive builds on Hadoop to provide tools for data summarization, ad hoc querying, and analysis of large […]
Read MoreAWS Multi-Factor Authentication In Action!
I love to speak to user groups because there’s little time pressure and plenty of time to engage with the audience. As part of last week’s very pleasant trip to Iowa, I spoke to the CIALUG (Central Iowa Linux User’s Group).We met in a very cool coworking space called the Impromptu Studio.The audience wasn’t shy […]
Read MoreLearn about Amazon Web Services in China, India and Singapore
My evangelism trips to Asia (India in 2007, Round-trip around the world in 2008) has always been action-packed. I love these trips especially because I learn a lot, make ton of new friends while spreading the word about something I am really passionate about. Hence, this time, I plan to visit China, Singapore and India. […]
Read MoreAnkoder – Video Encoding on Demand
Rex Chung wrote to tell me that his EC2-powered Ankoder site can now segment files for iPhone HTTP streaming. As noted in the iPhone Dev Center, HTTP streaming obviates the need for specialized servers, extra firewall entries, and othe complexity. It supports live and video on demand sessions. The Ankoder blog contains a complete walkthrough […]
Read MoreLower Prices for EC2 Windows Instances using Authentication Services
We’ve removed the distinction between Amazon EC2 running Windows and Amazon EC2 running Windows with Authentication Services, allowing all of our Windows instances to make use of Authentication Services such as LDAP, RADIUS, and Kerberos. With this change, any Windows instance can host a Domain Controller or join an existing domain. File sharing services such […]
Read MoreWebinar: Securing Public Cloud Infrastructures
Mark time in your calendars for a cloud security webinar co-presented by Amazon Web Services and enStratus on Wednesday October 7, 2009 at 11:30 AM – 12:15 PM Central Time US. Sign up today Public cloud computing has evolved into a mainstream approach for building out components of an IT infrastructure. Cost saving opportunities make […]
Read MoreBioinformatics, Genomes, EC2, and Hadoop
I think it is really interesting to see how breakthroughs and process improvements in one scientific or technical discipline can drive that discipline forward while also enabling progress in other seemingly unrelated disciplines. The Bioinformatics field is rife with examples of this pattern. Declining hardware costs, , the ability to do parallel processing, and algorithmic […]
Read MoreNew Public Data Set: Wikipedia XML Data
Weighing in at a whopping 500 GB (388 GB of data and 112 GB of free space to allow for some in-place decompression), the Wikipedia XML data is our newest Public Data Set. This data set contains all of the Wikimedia wikis in the form of wikitext source and metadata embedded in XML. We’ll be […]
Read MoreNew Public Data Set: Daily Global Weather
The folks at Infochimps have just released the Daily Global Weather Public Data Set. This 20 GB data set incorporates daily weather measurements (temperature, dew point, wind speed, humidity, barometric pressure, and so forth) from over 9000 weather stations around the world. The data was originally collected as part of the Global Surface Summary of […]
Read MoreNew Public Data Set: Sloan Digital Sky Survey DR6 Subset
The Sloan Digital Sky Survey, or SDSS, is now available as a Public Data Set. Weighing in at 180 GB, the SDSS is the most ambitious astronomical survey ever undertaken. The researchers have used a 2.5 meter, 120 megapixel telescope located in Apache Point, New Mexico to capture images of over one quarter of the […]
Read More