PinnedVu TrinhinData Engineer ThingsI spent 5 hours understanding more about the Delta Lake table formatAll insights from the paper: Delta Lake: High-Performance ACID Table Storage over Cloud Object Stores17 min read·May 4, 2024--2--2
PinnedVu TrinhinData Engineer ThingsHow does Uber build real-time infrastructure to handle petabytes of data every day?All insights from the paper: Real-time data infrastructure at Uber19 min read·Mar 23, 2024--14--14
Vu TrinhinThe Deep HubAll you need to know about the Google File SystemHow did Google build its large-scale file system?16 min read·May 12, 2024--5--5
Vu TrinhGroupBy #33: Data Gateway — A Platform for Growing and Protecting the Data Tier at Netflix, The…Plus: Solving RevenueCat’s data ingestion challenges into Snowflake, From ZooKeeper to KRaft: How the Kafka migration works6 min read·May 3, 2024----
Vu TrinhGroupBy #32: Canva — Scaling to Count Billions, Ensuring Precision and Integrity: A Deep Dive into…Plus: LLM fine-tuning and evaluation in BigQuery, How We Built Slack AI To Be Secure and Private7 min read·Apr 28, 2024----
Vu TrinhinTowards Data ScienceThe Stream Processing Model Behind Google Cloud DataflowBalancing correctness, latency, and cost in unbounded data processing14 min read·Apr 27, 2024----
Vu TrinhinData Engineer ThingsDo We Need the Lakehouse Architecture?When data lakes and data warehouses are not enough.10 min read·Apr 20, 2024--11--11
Vu TrinhGroupBy #31: Migrating a Trillion Entries of Uber’s Ledger Data from DynamoDB to LedgerStore, Grab…Plus: Airbnb open sourced Chronon — ML Feature Platform, BigQuery data canvas6 min read·Apr 17, 2024----
Vu TrinhinData Engineer ThingsA Closer Look Into Databricks’s Photon EnginePart 2 of Databricks’s Photon paper note: Vectorization12 min read·Apr 13, 2024----
Vu TrinhGroupBy #30: Uber- How LedgerStore Supports Trillions of Indexes, Composable Data Systems: Lessons…Plus: Spotify — Data Platform Explained, Grab — Turning observations into actionable insights for enhanced decision-making.5 min read·Apr 10, 2024----