Multi-tenanted streams @Workday - Enrico Agnoli & Leire Fernandez
At WORKDAY Inc., #1 Future Fortune company 2018 (link: https://fortune.com/future-50/2018/workday/), we process data for our community of more than 39 million workers, including 40 percent of Fortune 500 organizations. Our success is driven by the trust our customer puts on us and we give them confidence with our strict security regulations. This demands that we always encrypt customer data at rest and in transit: each piece of data should always be stored, encrypted with the customer key.
This is a challenge in a Data Streaming platform like Flink, where data may be persisted in multiple phases:
Storage of States in Checkpoints or Savepoints, Temporary fs storage for time-window aggregation, Common spilling to disk when heap is full.
On top of that, we need to consider that in a Flink dataflow data might get manipulated and we need to maintain the context needed to correctly encrypt it.
Come join us to see how we solved this challenges to provide a secure platform to support our MachineLearning organization, how we extended AVRO libraries to enable encryption at serialization and how we support data traceability for GDPR.
Видео Multi-tenanted streams @Workday - Enrico Agnoli & Leire Fernandez канала Flink Forward
This is a challenge in a Data Streaming platform like Flink, where data may be persisted in multiple phases:
Storage of States in Checkpoints or Savepoints, Temporary fs storage for time-window aggregation, Common spilling to disk when heap is full.
On top of that, we need to consider that in a Flink dataflow data might get manipulated and we need to maintain the context needed to correctly encrypt it.
Come join us to see how we solved this challenges to provide a secure platform to support our MachineLearning organization, how we extended AVRO libraries to enable encryption at serialization and how we support data traceability for GDPR.
Видео Multi-tenanted streams @Workday - Enrico Agnoli & Leire Fernandez канала Flink Forward
Показать
Комментарии отсутствуют
Информация о видео
Другие видео канала
Build a Table-centric Apache Flink Ecosystem - Shaoxuan WangFinding Bad Acorns - Andrew Gao & Jeff Sharpe#FlinkForward SF 2017: Ufuk Celebi - The Stream Processor as a DatabaseImproving throughput and latency with Flink's network stack - Nico KruberStreaming for Enterprises - Srikanth SatyaBuilding Unified Streaming Platform at UberAnalytics for the masses - Aslam TajwalaWriting an interactive streaming SQL engine and pre-parser using Flink - Kenny GormanInterview with Gyula Fóra, Data Warehouse Engineer at KingAdventures in Scaling from Zero to 5 Billion Data Points per Day - Dave TorokSplunk Data Stream ProcessorOne SQL to Rule Them All - Fabian HueskeBuilding an open-source ML feature store with Apache FlinkData Pipeline Lifecycle: SQL EverywhereCEP platform handling millions of users - lessons from 3 years in productionWhat turns stream processing from a tool into a platform? - Stephan EwenScotty: Efficient Window Aggregation with General Stream Slicing - Jonas Traub & Philipp GrulichKeeping Redditors safe in real-time with Flink Stateful FunctionsDistributed Processing for Machine Learning Production Pipelines - Altay, Crowe, RokniFlink Forward Berlin 2018 Highlights