Using Apache Kafka and Apache Pinot for User-Facing, Real-Time Analytics
00:00:00 Setup, Overview of Data Science DC
00:11:45 Intro of Speakers and Review of Use Cases
00:17:00 Intro to Apache Kafka, ksqlDB, and where it fits in real-time analytics system
00:29:56 Intro to Apache Pinot
00:32:00 Pinot Architecture
00:35:28 Demo - Pinot & Kafka
00:45:00 Recap of live demo
00:45:40 Apache Pinot Indexing (including Star-tree indexing)
00:48:55 Partitioning & Tiered Storage
00:53:15 Takeaways
00:55:12 Q&A
Apache Kafka is the de facto standard for real-time event streaming, but what do you do if you want to perform user-facing, ad-hoc, real-time analytics too? That's a hard problem. Apache Pinot solves it, and the two together are like chocolate and peanut butter, peaches and cream, and Steve Rogers and Peggy Carter. Come to this talk for an introduction to both systems and a view of how they work together.
About Apache Pinot:
Pinot is a real-time distributed OLAP datastore, built to deliver scalable real-time analytics with low latency. It can ingest from batch data sources (such as Hadoop HDFS, Amazon S3, Azure ADLS, Google Cloud Storage) as well as stream data sources (such as Apache Kafka).
Pinot was built by engineers at LinkedIn and Uber and is designed to scale up and out with no upper bound. Performance always remains constant based on the size of your cluster and an expected query per second (QPS) threshold.
--------------------------
SPEAKERS:
--------------------------
Tim Berglund is a teacher, author, and technology leader with Confluent, where he serves as the Senior Director of Developer Advocacy. He can frequently be found speaking at conferences in the United States and all over the world. He is the co-presenter of various training videos on topics ranging from Git to Distributed Systems to Apache Kafka. He tweets as @tlberglund, blogs very occasionally at http://timberglund.com, and lives in Littleton, CO, USA.
Neha Pawar is a Founding Engineer at a Stealth Mode Startup. Prior to this, she worked at LinkedIn as a Senior Software Engineer in the Data Analytics Infrastructure org. Neha is an Apache Pinot PMC and Committer & has made numerous impactful contributions to the Apache Pinot project. She actively fosters the growing Apache Pinot community & loves to evangelize Apache Pinot in the form of blogs, video tutorials, speaking in meetups and conferences. You can find her on Twitter at @nehapawar18.
Brought to you by Confluent, Pinot, and Data Science DC, part of Data Community DC.
Видео Using Apache Kafka and Apache Pinot for User-Facing, Real-Time Analytics канала StarTree
00:11:45 Intro of Speakers and Review of Use Cases
00:17:00 Intro to Apache Kafka, ksqlDB, and where it fits in real-time analytics system
00:29:56 Intro to Apache Pinot
00:32:00 Pinot Architecture
00:35:28 Demo - Pinot & Kafka
00:45:00 Recap of live demo
00:45:40 Apache Pinot Indexing (including Star-tree indexing)
00:48:55 Partitioning & Tiered Storage
00:53:15 Takeaways
00:55:12 Q&A
Apache Kafka is the de facto standard for real-time event streaming, but what do you do if you want to perform user-facing, ad-hoc, real-time analytics too? That's a hard problem. Apache Pinot solves it, and the two together are like chocolate and peanut butter, peaches and cream, and Steve Rogers and Peggy Carter. Come to this talk for an introduction to both systems and a view of how they work together.
About Apache Pinot:
Pinot is a real-time distributed OLAP datastore, built to deliver scalable real-time analytics with low latency. It can ingest from batch data sources (such as Hadoop HDFS, Amazon S3, Azure ADLS, Google Cloud Storage) as well as stream data sources (such as Apache Kafka).
Pinot was built by engineers at LinkedIn and Uber and is designed to scale up and out with no upper bound. Performance always remains constant based on the size of your cluster and an expected query per second (QPS) threshold.
--------------------------
SPEAKERS:
--------------------------
Tim Berglund is a teacher, author, and technology leader with Confluent, where he serves as the Senior Director of Developer Advocacy. He can frequently be found speaking at conferences in the United States and all over the world. He is the co-presenter of various training videos on topics ranging from Git to Distributed Systems to Apache Kafka. He tweets as @tlberglund, blogs very occasionally at http://timberglund.com, and lives in Littleton, CO, USA.
Neha Pawar is a Founding Engineer at a Stealth Mode Startup. Prior to this, she worked at LinkedIn as a Senior Software Engineer in the Data Analytics Infrastructure org. Neha is an Apache Pinot PMC and Committer & has made numerous impactful contributions to the Apache Pinot project. She actively fosters the growing Apache Pinot community & loves to evangelize Apache Pinot in the form of blogs, video tutorials, speaking in meetups and conferences. You can find her on Twitter at @nehapawar18.
Brought to you by Confluent, Pinot, and Data Science DC, part of Data Community DC.
Видео Using Apache Kafka and Apache Pinot for User-Facing, Real-Time Analytics канала StarTree
Показать
Комментарии отсутствуют
Информация о видео
Другие видео канала
Building Real-Time Analytics Applications Using Apache PinotIntro to Apache PinotRun Apache Spark on Kubernetes with Amazon EMR on Amazon EKS - AWS Online Tech TalksUber Marketplace Meetup: Using Distributed Locking to Build Reliable SystemsIoT Architectures + Use Cases for Apache Kafka - Consumer and Industrial IoT (IIoT / Industry 4.0)Apache Pinot 2021 Recap and 2022 Roadmap Community Discussion2. Motivations and Customer Use Cases | Apache Kafka® FundamentalsBBC Online: Architecting for Scale with the Cloud and ServerlessIntroduction to Apache Pinot | Pinot for Beginners | What is Apache Pinot?Getting Started with Apache PinotExtract Data From Rest Api In Talend 👉 How To Connect Rest Api In TalendNetflix - Presto & Iceberg for AnalyticsHow to use Flow to automate your Power BI Reports[Uber Open Source] Leveraging Pinot at Uber for Large-scale AnalyticsInteractive Realtime Dashboards On Data Streams Using Apache Kafka ,Druid And SupersetOptimizing for Speed and Scale of Real-Time Analytics Using Apache PinotA Deep Dive into Apache Kafka This is Event Streaming by Andrew Dunnings & Katherine StanleyDeveloping Real-Time Data Pipelines with Apache KafkaReal-time Tables in Apache PinotHow Uber scaled its Real Time Infrastructure to Trillion events per day