Загрузка...

Example python app doing OCR with Gemma 3 (12b parameter) running locally on MBP M1

Google DeepMind has been cooking lately, while everyone has been focusing on the Gemini 2.0 Flash native image generation release, Gemma 3 is also a impressive release for developers.

Here's a little app I build in python in a couple of hours with Claude 3.7 in u/cursor_ai showcasing that.
The app uses Streamlit for the UI, Ollama as the backend running Gemma 3 vision locally, PIL for image processing, and pdf2image for PDF support.

What a time to be alive!

https://github.com/adspiceprospice/localOCR

If you want to talk to me about Ai, reach out through ad1x.com, or here or on X https://x.com/adriantamplaru

Видео Example python app doing OCR with Gemma 3 (12b parameter) running locally on MBP M1 канала Adrian Tamplaru
Яндекс.Метрика

На информационно-развлекательном портале SALDA.WS применяются cookie-файлы. Нажимая кнопку Принять, вы подтверждаете свое согласие на их использование.

Об использовании CookiesПринять