HTalk DSL for HBase is now open-source

As a company, we are regularly using in our customer projects and software developments open-source components from the Hadoop ecosystem. HBase, Spark and Zeppelin are thus the three major Apache projects we have incorporated in our HFactory offering and it feels quite natural to bring our own contribution to the community now.

A humble one obviously as using such top-level Apache projects as the ones above mentioned, we have always felt we were standing on the shoulders of giants. Still we ...

Read more →
0

HFactory integrates Spark for advanced analytics

Posted on 17 May 2016

After using it in various customer projects over the last year, we have now finalised the integration of Apache Spark into the HFactory platform. From the very start, Spark was a natural extension of the work we had accomplished within HFactory: the solutions share the same functional programming principles, a focus on elegant and expressive APIs and both are built on the powerful Scala programming language. So we are especially pleased to now offer a fully packaged solution ...

Read more →
0

HBase, one API to rule them all

Posted on 6 May 2015

In a recent podcast with O’Reilly, Cloudera’s Michael Stack underlined the recent contributions of Google’s engineering team to the HBase roadmap. At the time, he mostly underlined Google’s huge experience and overall advance on the wide-column datastore technology, and their huge value add in making HBase an even better database.

Today’s announcement of a Cloud Bigtable offering on Cloud Platform put his remark into a new light. As emphasized by Google, Cloud BigTable is fully compatible with ...

Read more →
0

HBase: one database for both analytical and operational workloads

Posted on 15 Feb 2015

From its deep integration within the Hadoop platform, the promise of HBase is clear: one datastore for both analytical and operational workloads. But these two type of workloads command very different data interaction patterns.

On one hand, exploratory analytics is focused on complex analysis. Data analysts need to perform rich queries on the data, and typically generate analytic reports through the combination of usual SQL commands and BI visualization products. On the other hand, operational intelligence requires ...

Read more →
0

Three cool features of HBase

Posted on 29 Jan 2015

With the release of HBase v1.0 now imminent, we would like to pause and share our thoughts on some cool features of HBase. We will not talk here about HBase scalability, flexible schema design or deep integration with the Hadoop platform and ecosystem, this is all well known by now. Instead, we will focus on three additional characteristics of HBase that make it truly stand apart from other NoSQL databases:

  • Sorted row-keys
  • Control on data sharding
  • Strong consistency

Sorted row-keys

Manipulation ...

Read more →
0

A tribute to Facebook engineering

Posted on 30 Aug 2014

As a company heavily focused on HBase, it felt appropriate to pay tribute to Facebook engineering in this blog. Facebook’s decision to use HBase as the backend for its Messages application back in 2010 was arguably a pivotal moment in the development of the column-oriented, key value store.

Back then, HBase was mostly used for storing web crawling data, and deployments were few and far between. Also, Facebook had internally developed its own key value store, Cassandra, ...

Read more →
0