Is there a FOSS model that does #OCR using #VLM #LLM machine vision learning tricks to get text of tricky docs where #te...

Is there a FOSS model that does #OCR using #VLM #LLM machine vision learning tricks to get text of tricky docs where #tesseract fails? Using #apitokens to get #chatgpt & co to do it is stomach-turning....

Read Original

Related