Achieving Operational Excellence for data engineering - Shantanu Shirvastarva, Zeotap
Zeotap saw a 10x growth with the number of data pipelines and amount of data processed growing at rapid pace in a short span of time. The increasing scale challenged our capability to track production data pipelines across multiple products and infrastructure deployments. To tackle this problem, we came up with Kingpin - a Centralized Job Management Framework. Kingpin addresses some of the common issues faced with data pipeline operations: Workflow Dependency management, Multi-Framework execution support Fault Tolerance, and Alerting mechanisms, Scheduled and On-Demand executions Workflow tracking across multiple deployments. Additionally, we have integrated Kingpin with Data Visualization Tools for a Unified Job Level View with Metric Reporting capabilities to increase system visibility and reduce the operational time, efforts, and costs.
Видео Achieving Operational Excellence for data engineering - Shantanu Shirvastarva, Zeotap канала Qubole: The Cost-Efficient Data Lake
Видео Achieving Operational Excellence for data engineering - Shantanu Shirvastarva, Zeotap канала Qubole: The Cost-Efficient Data Lake
Показать
Комментарии отсутствуют
Информация о видео
25 ноября 2020 г. 21:24:27
00:26:14
Другие видео канала
Talk #2: Adobe Experience Platform Insights on Achieving High Scale Using Apache AirflowData Leadership: Innovation and Value Creation - Kirk Borne, Booz Allen HamiltonOracle - Justin Wainwright - Data Platforms 2017A Hub and Spoke Approach to Scaling Storage - Mark Senerth & Mohan Naidu, The Walt Disney CompanyWorkbench: Create, Execute and Save a CommandKarthik Panel - Data Platforms 2017[Ai4 Webinar] Modernizing ML & AI Operations to Advance Healthcare'Data Governance in Multi-Tenant Data Lakes - A Tech Perspective' - Sathish K S, ZeotapThe Open Data Lake Talks Optimizing Costs in A Changing WorldBuilding a Real-Time Decision Engine Using ML on Apache Spark Structured StreamingWebinar: Unlock AI Use Cases - Ignite Spark with Jupyter NotebooksPresto Summit India 2019 - "Towards GDPR CCPA compliance with Hive ACID"What is Qubole? As Told By QubolersRunning Apache Spark jobs cheaper while maximizing performance - Brad Caffey, Expedia GroupNexla - Data Platforms 2017Building and Scaling a Data and Analytics Ecosystem - Prabhu Prakesh Ganesh, CTO, MiQAnalytics on Analytics: Leveraging Metadata in the Big Data Landscape - Kent Buboltz, Expedia GroupData Lakes Fundamentals and Best Practices - Lessons learned in Planning, Strategy, and ExecutionQubole: AWS Graviton Processor SupportDave Wilby (Return Path) showcases innovative AI and Machine Learning platform powered by QuboleDecentralized Data Platform at Bukalapak - Hafiz Badrie Lubis, Bulkalapak