How to Process Slowly Changing Dimensions in Hive
How to handle Slowly Changing Dimensions (SCD) in a data warehouse with Hive database.
5 min readRecommender Systems Comparison: The Best Performing Algorithm
How to use packages for building recommender systems in R: recommenderlab, recosystem, SlopeOne and SVDApproximation.
2 min readBig Data Warehousing with Elasticsearch
Here's how to address Big Data warehousing with Elasticsea rch.
2 min readCassandra + Spark SQL Performance (including DSE 5.0)
This post focuses on quick and dirty performance comparison of different Cassandra + Spark options.
21 min readBitcoin Analytics: The Principles of Network Development, Part 1
Bitcoin: brief history of the digital currency network, with the most influential nodes, patterns, and changes over time.
6 min readSetup Cassandra + Spark + Tableau (including DSE 5.0)
Here's how to configure Cassandra + Spark + Tableau including DataStax Enterprise (DSE) 5.0
22 min readRunning Cassandra with Cloudera Manager
Here're the benefits of Cassandra and CDH integration that can be deployed and managed through Cloudera Manager.
3 min readInstalling Hadoop Cluster with Cloudera Manager
How fast and easy it may be to install Hadoop cluster with Cloudera Manager.
9 min readEnabling TLS Level 1 Encryption for Cloudera Manager
How to configure TLS Level 1 encryption for Cloudera Manager with self-signed certificate.
4 min readAnomaly Detection – Unsupervised Approach
From data mining to intrusion detection: how anomaly helps to identify informational security risk.
1 min read