Useful Links related to BigData tools
- Hadoop Weekly - the best resources about Hadoop and related tools
- Kafka - the Definitive Guide
- DataBricks: Spark Tungsten Internals
- DataBricks: Spark Streaming Execution Model
- Haifeng Li: Introduction to Big Data – free book “geared toward software architects and advanced developers”
- Haifeng Li: Blog on Big Data
- Waiting For Code - Blog on Spark, Beam, and similar tools
- Spark for Data Science: the Good, Bad and Ugly
- Astronomer’s Apache Airflow resources
- Scylla DB - ScyllaDB vs Amazon DynamoDB
Hive UDFs
OpenStack
- OpenStack Summit Berlin 2018 - has a short but good overview of OpenStack
- OpenStack version 14 released - goals and new features for OpenStack
Other
- Steve Loughran: Hadoop and Kerberos
- thisdataguy: Blog – interesting blog on bigdata topics
- Data Engineers vs Data Scientists - different roles in a Big Data project
- Jaeger - distributed tracing system
- Magalix: Kubernetes-101
- John Oliver: Artificial Intelligence