Mastodon discussion Discussions May 8 4 views

Is there a FOSS model that does #OCR using #VLM #LLM machine vision learning tricks to get text of tricky docs where #te...

by adingbatponder :nixos: 👾

Is there a FOSS model that does #OCR using #VLM #LLM machine vision learning tricks to get text of tricky docs where #tesseract fails? Using #apitokens to get #chatgpt & co to do it is stomach-turning....

Read Original

Metadata

Account: adingbatponder@fosstodon.org

Mastodon discussion 1m ago

Your thinner competitor's page ranks higher in ChatGPT—not because of the writing, but because RAG pipelines don't chunk...

Your thinner competitor's page ranks higher in ChatGPT—not because of the writing, but because RAG pipelines don't chunk WYSIWYG fields. They chunk semantic units. Headings define ...

Mastodon discussion 3m ago

Japan must develop its own strategy to strengthen submarine cable resilience by balancing security and infrastructure co...

Japan must develop its own strategy to strengthen submarine cable resilience by balancing security and infrastructure cooperation with private technology firms as global competitio...

Mastodon discussion 3m ago

🤖 AI helps read papyrus scroll burnt to crisp during Vesuvius eruptionPreviously hidden text revealed without unrolling ...

🤖 AI helps read papyrus scroll burnt to crisp during Vesuvius eruptionPreviously hidden text revealed without unrolling scroll discusses stoic philosophy on ethics, art and human b...

Is there a FOSS model that does #OCR using #VLM #LLM machine vision learning tricks to get text of tricky docs where #te...

Metadata

Related

Your thinner competitor's page ranks higher in ChatGPT—not because of the writing, but because RAG pipelines don't chunk...

Japan must develop its own strategy to strengthen submarine cable resilience by balancing security and infrastructure co...

🤖 AI helps read papyrus scroll burnt to crisp during Vesuvius eruptionPreviously hidden text revealed without unrolling ...