Example python app doing OCR with Gemma 3 (12b parameter) running locally on MBP M1
Google DeepMind has been cooking lately, while everyone has been focusing on the Gemini 2.0 Flash native image generation release, Gemma 3 is also a impressive release for developers.
Here's a little app I build in python in a couple of hours with Claude 3.7 in u/cursor_ai showcasing that.
The app uses Streamlit for the UI, Ollama as the backend running Gemma 3 vision locally, PIL for image processing, and pdf2image for PDF support.
What a time to be alive!
https://github.com/adspiceprospice/localOCR
If you want to talk to me about Ai, reach out through ad1x.com, or here or on X https://x.com/adriantamplaru
Видео Example python app doing OCR with Gemma 3 (12b parameter) running locally on MBP M1 канала Adrian Tamplaru
Here's a little app I build in python in a couple of hours with Claude 3.7 in u/cursor_ai showcasing that.
The app uses Streamlit for the UI, Ollama as the backend running Gemma 3 vision locally, PIL for image processing, and pdf2image for PDF support.
What a time to be alive!
https://github.com/adspiceprospice/localOCR
If you want to talk to me about Ai, reach out through ad1x.com, or here or on X https://x.com/adriantamplaru
Видео Example python app doing OCR with Gemma 3 (12b parameter) running locally on MBP M1 канала Adrian Tamplaru
Комментарии отсутствуют
Информация о видео
20 марта 2025 г. 18:43:42
00:02:06
Другие видео канала