Network analysis in journalism: Practices and possibilities
Author:
Jonathan Stray, School of Journalism, Columbia University
Abstract:
Network analysis has been proposed and taught as an enabling technology for computational journalism. We report on a set of 34 stories involving network analysis, including three in-depth case studies and a breakdown of how networks were obtained, analyzed, and presented. Networks were commonly extracted from unstructured documents like public records. Visualization is the main analysis method and a popular way to present results. Algorithmic techniques such as centrality methods and community detection are known to journalists but seldom used, in part because the results are hard to interpret and contextualize. On the cutting edge of investigative journalism, graph databases are emerging as a powerful and flexible technology suitable for fusion of diverse data sets. However, these databases cannot be directly visualized because they are dirty and overwhelmingly large. Based on journalist interviews and the experience of previous stories, we explore the long practice of making network "sketches" or "maps." These are not so much data visualizations but conceptual tools that investigators use to think through a story. Maintaining a mapping between these sketches and the underlying graph database is a major design challenge. We propose a system that integrates a deep graph data store with interactive record linkage and network sketching.
More on http://www.kdd.org/kdd2017/
KDD2017 Conference is published on http://videolectures.net/
Видео Network analysis in journalism: Practices and possibilities канала KDD2017 video
Jonathan Stray, School of Journalism, Columbia University
Abstract:
Network analysis has been proposed and taught as an enabling technology for computational journalism. We report on a set of 34 stories involving network analysis, including three in-depth case studies and a breakdown of how networks were obtained, analyzed, and presented. Networks were commonly extracted from unstructured documents like public records. Visualization is the main analysis method and a popular way to present results. Algorithmic techniques such as centrality methods and community detection are known to journalists but seldom used, in part because the results are hard to interpret and contextualize. On the cutting edge of investigative journalism, graph databases are emerging as a powerful and flexible technology suitable for fusion of diverse data sets. However, these databases cannot be directly visualized because they are dirty and overwhelmingly large. Based on journalist interviews and the experience of previous stories, we explore the long practice of making network "sketches" or "maps." These are not so much data visualizations but conceptual tools that investigators use to think through a story. Maintaining a mapping between these sketches and the underlying graph database is a major design challenge. We propose a system that integrates a deep graph data store with interactive record linkage and network sketching.
More on http://www.kdd.org/kdd2017/
KDD2017 Conference is published on http://videolectures.net/
Видео Network analysis in journalism: Practices and possibilities канала KDD2017 video
Показать
Комментарии отсутствуют
Информация о видео
Другие видео канала
![Designing AI at Scale to Power Everyday Life](https://i.ytimg.com/vi/7dn3VNkopfQ/default.jpg)
![Estimation of Recent Ancestral Origins of Individuals on a Large Scale](https://i.ytimg.com/vi/Gjy82uOnnho/default.jpg)
![struc2vec: Learning Node Representations from Structural Identity](https://i.ytimg.com/vi/lu0xMOO48Xo/default.jpg)
![Learning to Generate Rock Descriptions from Multivariate Well Logs with Hierarchical Attention](https://i.ytimg.com/vi/DXXWolWbbzY/default.jpg)
![A Local Algorithm for StructurePreserving Graph Cut](https://i.ytimg.com/vi/l5PRRHuhUQU/default.jpg)
![EmbedJoin: Efficient Edit Similarity Joins via Embeddings](https://i.ytimg.com/vi/QT_hwmQy_DU/default.jpg)
![Multi-Aspect Streaming Tensor Completion](https://i.ytimg.com/vi/PNC2_Kg8Qvo/default.jpg)
![Planning Bike Lanes based on SharingBikes' Trajectories](https://i.ytimg.com/vi/K9aUARUrt_Y/default.jpg)
![Tripoles: A New Class of Relationships in Time Series Data](https://i.ytimg.com/vi/LG1_JO-tsU4/default.jpg)
![Industrial Machine Learning](https://i.ytimg.com/vi/6u-Mz-RDUGg/default.jpg)
![Optimal Reserve Price for Online Ads Trading Based on Inventory Identification](https://i.ytimg.com/vi/JitmXoTVbjE/default.jpg)
![Multi-Aspect Streaming Tensor Completion](https://i.ytimg.com/vi/g8ZbBp8xk6U/default.jpg)
![Dispatch with Confidence: Integration of machine learning](https://i.ytimg.com/vi/lj95WCwgmoY/default.jpg)
![Learning certifiably optimal rule lists for categorical data](https://i.ytimg.com/vi/ebJHnDLLTKA/default.jpg)
![Interpretable Predictions of Tree-based Ensembles via Actionable Feature Tweaking](https://i.ytimg.com/vi/KIP2N5HZRW8/default.jpg)
![HinDroid: An Intelligent Android Malware Detection System](https://i.ytimg.com/vi/pQbExw-gN7k/default.jpg)
![Revisiting power-law distributions in spectra of real world networks](https://i.ytimg.com/vi/4W8lNRpcbiQ/default.jpg)
![KDD Business Lunch](https://i.ytimg.com/vi/_poHfHuTjR4/default.jpg)
![Traffic sensor health monitoring using spatiotemporal graphical modeling](https://i.ytimg.com/vi/nLxE0lbp4NQ/default.jpg)
![A Practical Algorithm for Solving the Incoherence Problem of Topic Models In Industrial Applications](https://i.ytimg.com/vi/5zoPXPbI1Jw/default.jpg)
![Detecting Network Effects: Randomizing Over Randomized Experiments](https://i.ytimg.com/vi/1v5_CzdRVAc/default.jpg)