Cassandra Cluster Synchronization


Cassandra is a highly-distributable NoSQL database with tunable consistency. What makes it highly distributable makes it also, in part, vulnerable: the whole deployment must run on synchronized clocks. It’s quite surprising that, given how crucial this is, it is not covered sufficiently in literature. And, if it is, it simply refers…

Data tiedowns with reliable time stamping


Management teams are  growing more reliant on the ability to immediately  access and quickly sort through massive amounts of data to find the information they need - Data Governance for Financial Institution A "data tiedown" is a reliable and cross-checked timestamp that secures a data item or group of data items…

Obama’s big data team


Those applications used a huge chunk of Amazon's Eastern data center. “We had thousands of nodes,” said VanDenPlas. “We pushed 180TB of traffic with billions and billions of requests. We had 60% of all of Amazon's medium [instances] in US East.” Read more: http://sdt.bz/37299#ixzz2I0RN4r00

Big Data and politics


For the general public, there was no way to know that the idea for the Parker contest had come from a data-mining discovery about some supporters: affection for contests, small dinners and celebrity. But from the beginning, campaign manager Jim Messina had promised a totally different, metric-driven kind of campaign…