Architecting a Low-Latency Schemaless SQL Engine | Rockset
Get the slides: https://www.datacouncil.ai/talks/architecting-low-latency-schema-less-sql-engine
ABOUT THE TALK
In this talk we'll walk through the journey of building Rockset, a data system for low-latency queries for search and analytics. Rockset enables SQL queries on schemaless datasets. What are some of the challenges when adapting SQL to principles of strong dynamic typing? We will also discuss the idea of full converged indexing -- building both columnar store and inverted indexes on all the fields in the entire dataset. Indexes make queries faster, but also increase storage costs and write latencies. When are the tradeoffs worth it? Finally, we'll talk about the challenges and opportunities of designing data systems in the cloud native world. Can we build a stateful system that autoscales based on load?
ABOUT THE SPEAKER
Igor Canadi is a software engineer at Rockset, where he is developing its data indexing and distributed SQL query engine. Previously, Igor was an engineer at Facebook, working on the database engineering and product infrastructure teams, where he contributed to RocksDB, developed MongoRocks and MongoDB with RocksDB storage engine, drove RocksDB open source initiatives, worked on core GraphQL infrastructure for Facebook’s Android application, and owned GraphQL developer tooling for hundreds of developers. Igor holds a master’s degree in computer science from the University of Wisconsin-Madison and a bachelor’s degree from the University of Zagreb. In his free time, he likes sailing and snowboarding.
ABOUT DATA COUNCIL:
Data Council (https://www.datacouncil.ai/) is a community and conference series that provides data professionals with the learning and networking opportunities they need to grow their careers. Make sure to subscribe to our channel for more videos, including DC_THURS, our series of live online interviews with leading data professionals from top open source projects and startups.
FOLLOW DATA COUNCIL:
Twitter: https://twitter.com/DataCouncilAI
LinkedIn: https://www.linkedin.com/company/datacouncil-ai
Facebook: https://www.facebook.com/datacouncilai
Eventbrite: https://www.eventbrite.com/o/data-council-30357384520
Видео Architecting a Low-Latency Schemaless SQL Engine | Rockset канала Data Council
ABOUT THE TALK
In this talk we'll walk through the journey of building Rockset, a data system for low-latency queries for search and analytics. Rockset enables SQL queries on schemaless datasets. What are some of the challenges when adapting SQL to principles of strong dynamic typing? We will also discuss the idea of full converged indexing -- building both columnar store and inverted indexes on all the fields in the entire dataset. Indexes make queries faster, but also increase storage costs and write latencies. When are the tradeoffs worth it? Finally, we'll talk about the challenges and opportunities of designing data systems in the cloud native world. Can we build a stateful system that autoscales based on load?
ABOUT THE SPEAKER
Igor Canadi is a software engineer at Rockset, where he is developing its data indexing and distributed SQL query engine. Previously, Igor was an engineer at Facebook, working on the database engineering and product infrastructure teams, where he contributed to RocksDB, developed MongoRocks and MongoDB with RocksDB storage engine, drove RocksDB open source initiatives, worked on core GraphQL infrastructure for Facebook’s Android application, and owned GraphQL developer tooling for hundreds of developers. Igor holds a master’s degree in computer science from the University of Wisconsin-Madison and a bachelor’s degree from the University of Zagreb. In his free time, he likes sailing and snowboarding.
ABOUT DATA COUNCIL:
Data Council (https://www.datacouncil.ai/) is a community and conference series that provides data professionals with the learning and networking opportunities they need to grow their careers. Make sure to subscribe to our channel for more videos, including DC_THURS, our series of live online interviews with leading data professionals from top open source projects and startups.
FOLLOW DATA COUNCIL:
Twitter: https://twitter.com/DataCouncilAI
LinkedIn: https://www.linkedin.com/company/datacouncil-ai
Facebook: https://www.facebook.com/datacouncilai
Eventbrite: https://www.eventbrite.com/o/data-council-30357384520
Видео Architecting a Low-Latency Schemaless SQL Engine | Rockset канала Data Council
Показать
Комментарии отсутствуют
Информация о видео
Другие видео канала
![DC_THURS : dbt w/ Drew Banin](https://i.ytimg.com/vi/zc8wLzoAkVc/default.jpg)
![DevOps for Machine Learning & Other Half Truths Processes & Tools for the ML Lifecycle | DataRobot](https://i.ytimg.com/vi/z7m9B6vSVe8/default.jpg)
![Data Discovery Getting More From Your Metadata](https://i.ytimg.com/vi/WDi3rEe_Eow/default.jpg)
![Technical Founders Panel](https://i.ytimg.com/vi/mRgyDCtL6-k/default.jpg)
![Feed The Alligators With the Lights On: How Data Engineers Can See Who Really Uses Data | Stemma](https://i.ytimg.com/vi/4WO3klWEhiI/default.jpg)
![Building High Performance Recommender Systems with Feature Stores | Tecton](https://i.ytimg.com/vi/F7-7349p0Ok/default.jpg)
![Office Hours with Stitch Fix Data Platform](https://i.ytimg.com/vi/IabnpQAGkRo/default.jpg)
![DC_THURS on Trino](https://i.ytimg.com/vi/qGvZhwJWAaw/default.jpg)
![Enterprise Data Science Comes of Age | Anaconda](https://i.ytimg.com/vi/VZ3LLPKYjVE/default.jpg)
![Using Machine Learning and Observability Together to Reduce Incident Impact | DigitalOcean](https://i.ytimg.com/vi/QxocoT6Aeuo/default.jpg)
![Making Friends with Generative Models | Tonic](https://i.ytimg.com/vi/7WdMOfoBDpk/default.jpg)
![The Right Way to Track Mobile Data](https://i.ytimg.com/vi/qGgWe9GBUNk/default.jpg)
![DC_THURS on Feature Engineering](https://i.ytimg.com/vi/ewVwxuDizUQ/default.jpg)
![Scaling Uber's Metric System from Elasticsearch to Pinot | Uber](https://i.ytimg.com/vi/u82r_eqUaiI/default.jpg)
![Rikai: A New Data Format for Analytics on Unstructured Data at Scale](https://i.ytimg.com/vi/FVYOLcKNmsM/default.jpg)
![DC_THURS on DataHub w/ Shirshanka Das (Acryl Data)](https://i.ytimg.com/vi/lBbrilDAFMs/default.jpg)
![The Road to Exceptional Data Correctness](https://i.ytimg.com/vi/Ii2S_prglbc/default.jpg)
![Building an ML Experimentation Platform for Easy Reproducibility | Treeverse](https://i.ytimg.com/vi/FLtqcrJ7Vws/default.jpg)
![How Vercel Builds Dozens of Metrics from One Heterogenous Table](https://i.ytimg.com/vi/n3KUORtd5J4/default.jpg)
![DC_THURS w/ Patrick Thompson, CEO of Iteratively](https://i.ytimg.com/vi/-6zTaAtaLzM/default.jpg)