BigQuery Tips: Nested and Repeated Fields and How GOJEK Builds Data Warehouses (Cloud Next '19)
Gain expert advice on BigQuery to take advantage of data warehousing. In this session, GOJEK, a global company with multiple products and services and lots of data, will share how they have designed their data model to have BigQuery and Google Cloud Storage for their data lake and data warehouse, how they've set up their infrastructure using GCP, and how they've built internal tools as part of this process. GOJEK will also share how they govern their data warehouse to support their business, accelerate their decision making process, and gain intelligence about their products.
You will also get an in-depth walk through on how to work with semi-structured data, including how to ingest JSON array data types inside of BigQuery. But while denormalizing your schema into a single table with nested and repeated fields can yield performance improvements, the SQL syntax for working with array data can be tricky. Follow along as we demo loading, querying, troubleshooting, and unnesting various semi-structured data sets.
Build with Google Cloud → https://bit.ly/2TWjsxq
Watch more:
Next '19 Data Analytics Sessions here → https://bit.ly/Next19DataAnalytics
Next ‘19 All Sessions playlist → https://bit.ly/Next19AllSessions
Subscribe to the GCP Channel → https://bit.ly/GCloudPlatform
Speaker(s): Evan Jones, Gaurav Anand, Zaki Indra Sukma
Session ID: DA205
product:BigQuery,Cloud for Marketing; fullname:Gaurav Anand,Evan Jones;
Видео BigQuery Tips: Nested and Repeated Fields and How GOJEK Builds Data Warehouses (Cloud Next '19) канала Google Cloud Tech
You will also get an in-depth walk through on how to work with semi-structured data, including how to ingest JSON array data types inside of BigQuery. But while denormalizing your schema into a single table with nested and repeated fields can yield performance improvements, the SQL syntax for working with array data can be tricky. Follow along as we demo loading, querying, troubleshooting, and unnesting various semi-structured data sets.
Build with Google Cloud → https://bit.ly/2TWjsxq
Watch more:
Next '19 Data Analytics Sessions here → https://bit.ly/Next19DataAnalytics
Next ‘19 All Sessions playlist → https://bit.ly/Next19AllSessions
Subscribe to the GCP Channel → https://bit.ly/GCloudPlatform
Speaker(s): Evan Jones, Gaurav Anand, Zaki Indra Sukma
Session ID: DA205
product:BigQuery,Cloud for Marketing; fullname:Gaurav Anand,Evan Jones;
Видео BigQuery Tips: Nested and Repeated Fields and How GOJEK Builds Data Warehouses (Cloud Next '19) канала Google Cloud Tech
Показать
Комментарии отсутствуют
Информация о видео
Другие видео канала
BigQuery Nested and Repeated Fields: Dig Deeper into Data (Cloud Next '18)Data Warehousing With BigQuery: Best Practices (Cloud Next '19)Data Modeling for BigQuery (Google Cloud Next '17)HSBC Invents New Technology as They Migrate to BigQuery (Cloud Next '19)Big Query Live Training - A Deep Dive into Data PipeliningPartitioning and Clustering with BigQueryCloud Data Warehouse Benchmark Redshift vs Snowflake vs BigQuery | FivetranBest practices from experts to maximize BigQuery performance (featuring Twitter)Unlocking the Power of Google BigQuery (Cloud Next '19)A Modern Data Pipeline in Action (Cloud Next '18)Unnesting RECORD arrays in BigQuery SQLBigQuery Basics - Understand Columnar Storage format featureThe Migration Chronicles: CBSi Moves from Teradata and Hadoop to BigQuery (Cloud Next '19)Best Practices for GCE Enterprise Deployments (Cloud Next '19)Advanced BigQuery features: keys to the cloud datawarehouse of the future (Google Cloud Next '17)Google Cloud Summit Seoul '19 KeynoteBig Query For Marketers (join data, unnesting record arrays, GDS) - Part 3How to build and deploy a recommendation system with BigQuery MLUsing PySpark on Dataproc Hadoop Cluster to process large CSV fileProtecting Sensitive Data in Huge Datasets (Cloud Next '19)