This is a repository for my research, paper reading summaries/reviews, and relevant blog-like posts in markdown.
A schedule of the readings for CMPS 278 for my convenience. I’ll also link my reading reviews as I write them, just so that this feels a bit more complete.
Paper F1: A Distributed SQL Database That Scales (also discussing Spanner a little)
Review F1 - Reading review
MapReduce: Simplified Data Processing on Large Clusters
Review MapReduce - Reading Review
Bigtable: A Distributed Storage System for Structured Data
Review BigTable - Reading Review
Resilient Distributed Datasets: A Fault-Tolerant Abstraction for In-Memory Cluster Computing
Amazon Aurora: Design Considerations for High Throughput Cloud-Native Relational Databases
Discretized Streams: Fault-Tolerant Streaming Computation at Scale
Predicting Multiple Metrics for Queries: Better Decisions Enabled by Machine Learning
The Case for Learned Index Structures
Cicada: Dependably Fast Multi-Core In-Memory Transactions
Eliminating Unscalable Communication in Transaction Processing
An Evaluation of Distributed Concurrency Control
Optimizing Space Amplification in RocksDB
GraphX: Graph Processing in a Distributed Dataflow Framework
Scalable Atomic Visibility with RAMP Transactions
Guest Speaker
-Data Curation at Scale: The Data Tamer System-
Data provenance to audit compliance with privacy policy in the Internet of Things
The Myria Big Data Management and Analytics System and Cloud Service
NoDB: efficient query execution on raw data files
The Case For Heterogeneous HTAP