In-memory processing has started to become the norm in large-scale data handling. This is aclose to the metal analysis of highly important but often neglected aspects of memory access times and how it impacts big data and NoSQL technologies.
We cover aspects such as the TLB, the Transparent Huge Pages, the QPI Link, Hyperthreading and the impact of virtualization on high-memory footprint applications. We present benchmarks of various technologies ranging from Cloudera’s Impala to Couchbase and how they are impacted by the underlying hardware.
The key takeaway for the presentation bellow is a better understanding of how to size a cluster, how to choose a cloud provider and an instance type for big data and NoSQL workloads and why not every core or GB of RAM is created equal.
For several decades, SQL (Structured Query Language) has been the most used programming language for managing databases. SQL is a relational database management system (RDBMS). Recently, NoSQL has arisen as an option to SQL, and for many this is exciting because it frees the database manager from the restrictive tables and columns associated with SQL, which are not well-suited to managing unstructured data sets associated with big data. What, exactly, are the differences between the two, and when is NoSQL a preferable model to SQL? Continue Reading
Next week we will be presenting in a very interesting meetup, where we will talk about Couchbase and Hadoop: how to quickly move data from one to the other and how to get sub-milliseconds response time with Couchbase.
The meetup will take place on the 17th of September and it will feature speakers from Avira as well.
Avira is a worldwide leading supplier of security solutions for professional and private use. They have been using Couchbase and CDH in production for two years, working on customer behavior analysis and exploring how machine learning can improve their clients’ experience.
Many IT directors are failing to get the optimum performance from their infrastructure. So we’ve been conducting a number of benchmarking studies to see to how this can be improved.
Our product manager Alex Bordei presented the findings on an O’Reilly webcast which was hosted by O’Reilly’s Chief Data Scientist and Director of Content Strategy for Data, Ben Lorica, a true big data luminary.