How to Process Slowly Changing Dimensions in Hive
How to handle Slowly Changing Dimensions (SCD) in a data warehouse with Hive database.
Recommender Systems Comparison: The Best Performing Algorithm
How to use packages for building recommender systems in R: recommenderlab, recosystem, SlopeOne and SVDApproximation.
Big Data Warehousing with Elasticsearch
Here's how to address Big Data warehousing with Elasticsea rch.
Product Development Services 2.0: Startup Success for Venture Capitalists
Why Product Development Services 2.0 may be venture capitalists' panacea ensuring startups' success.
Cassandra + Spark SQL Performance (including DSE 5.0)
This post focuses on quick and dirty performance comparison of different Cassandra + Spark options.
Bitcoin Analytics: The Principles of Network Development, Part 1
Bitcoin: brief history of the digital currency network, with the most influential nodes, patterns, and changes over time.
8 Issues Moving MS SQL Cluster from BareMetal to AWS
Here are the eight most common challenges that can crop up during an MS SQL migration from BareMetal to AWS.
Setup Cassandra + Spark + Tableau (including DSE 5.0)
Here's how to configure Cassandra + Spark + Tableau including DataStax Enterprise (DSE) 5.0
Skype Automation: Bulk Renaming of Contacts with PowerShell
Automation hocus-pocus: forget about manual Skype contacts renaming with PowerShell & a small additional library.
Running Cassandra with Cloudera Manager
Here're the benefits of Cassandra and CDH integration that can be deployed and managed through Cloudera Manager.