Maciej Kula - Speeding up search with locality sensitive hashing
Maciej Kula - Speeding up search with locality sensitive hashing
[EuroPython 2015]
[24 July 2015]
[Bilbao, Euskadi, Spain]
Locality sensitive hashing (LSH) is a technique for reducing complex
data down to a simple hash code. If two hash codes are similar than
the original data is similar. Typically, they are used for speeding up
search and other similarity comparisons.
In this presentation I will discuss two ways of implementing LSH in
python; the first method is completely stateless but only works on
certain forms of data; the second is stateful but does not make any
assumptions about the distribution of the underlying data. I will
conclude the presentation by describing how we apply LSH to search at
Lyst.
Видео Maciej Kula - Speeding up search with locality sensitive hashing канала EuroPython Conference
[EuroPython 2015]
[24 July 2015]
[Bilbao, Euskadi, Spain]
Locality sensitive hashing (LSH) is a technique for reducing complex
data down to a simple hash code. If two hash codes are similar than
the original data is similar. Typically, they are used for speeding up
search and other similarity comparisons.
In this presentation I will discuss two ways of implementing LSH in
python; the first method is completely stateless but only works on
certain forms of data; the second is stateful but does not make any
assumptions about the distribution of the underlying data. I will
conclude the presentation by describing how we apply LSH to search at
Lyst.
Видео Maciej Kula - Speeding up search with locality sensitive hashing канала EuroPython Conference
Показать
Комментарии отсутствуют
Информация о видео
Другие видео канала
Felix Wick/Florian Wilhelm - How to Setup a new Python ProjectRuud van der Ham - Simulation of logistic systems in Python with salabimFernando Masanori Ashikaga/Paola Katherine Pacheco/Kátia Nakamura - import communitySebastian Roll - The Dangers of Outsourcing Software DevelopmentFergal Walsh - Rethinking how we build HTTP APIsMoshe Goldstein/david dayan - Implementing Parallel Programming Design Patterns using EFL for PythonJoe Drumgoole - Introduction to Python and MongoDBOlivier Grisel - Histogram-based Gradient Boosting in scikit-learn 0.21Olga Lyashevska - How can machine learning help to predict changes in size of Atlantic herring ?Marko Samastur - Publish your code so others can use it in 5 easy stepsAnna Veronika Dorogush - CatBoost - the new generation of Gradient BoostingMiloslav Pojman - HTTP/3 – Why should I care?Bruno Paixão - Digital Transformation in the fight against CoronavirusMax Humber - How to read (code)Radosław Jankiewicz - Writing quality codeHynek Schlawack - Maintaining a Python Project When It’s Not Your JobIvana Kellyerova - import bacon 🥓Mark Smith - Pythonic Refactoring: Protecting Your Users From Changevincent warmerdam - The Joy of Simulation: for Fun and ProfitJeffrey Mew - Supercharge your Data Science workflow with Notebooks, VS Code, and AzureBojan Miletic - Django Testing on Steroid: pytest + Hypothesis