Загрузка страницы

Large scale Natural Language Processing of biomedical literature (Beam Summit Europe 2019)

Large scale Natural Language Processing of biomedical literature in Python with beam and spacy

We use beam to extract the relations between entities such as genes, drugs, and diseases from biomedical literature and build a knowledge graph from the extracted relations. Using the knowledge graph to match existing drugs to rare diseases, Healx is on a mission to advance 100 rare disease treatments towards the clinic by 2025.

Beam allows us to build a knowledge graph encapsulating these relations at scale. We can process about 30 million PubMed abstracts to build our internal knowledge graph in less than 30 hours. Using Dataflow to run our beam job allows us to quickly scale a large cluster up and down depending on the computational needs. The potential for streaming in documents means we don’t need to rebuild our knowledge graph and can continuously push updates from novel publications. Developing and running beam jobs in Python still has some challenges which I will also talk about.

Speakers:
Christiaan Swart - NLP Engineer @ Healx

The Beam Summit Europe 2019 was a 2 day event held in Berlin at the KulturBrauerei, all focused around Apache Beam.

For more information about the Beam Summit, follow us on twitter @BeamSummit or go to the website: https://beamsummit.org/

Видео Large scale Natural Language Processing of biomedical literature (Beam Summit Europe 2019) канала Apache Beam
Показать
Комментарии отсутствуют
Введите заголовок:

Введите адрес ссылки:

Введите адрес видео с YouTube:

Зарегистрируйтесь или войдите с
Информация о видео
27 июня 2019 г. 1:26:48
00:20:52
Яндекс.Метрика