Why LLMs still suck at OCR
21 by ritvikpandey21 | 13 comments on Hacker News.
Document ingestion and the launch of Gemini 2.0 caused a lot of buzz this week. As a team building in this space, this is something we researched thoroughly. Here’s our take: ingestion is a multistep pipeline, and maintaining confidence from LLM nondeterministic outputs over millions of pages is a problem.
21 by ritvikpandey21 | 13 comments on Hacker News.
Document ingestion and the launch of Gemini 2.0 caused a lot of buzz this week. As a team building in this space, this is something we researched thoroughly. Here’s our take: ingestion is a multistep pipeline, and maintaining confidence from LLM nondeterministic outputs over millions of pages is a problem.
Comments
Post a Comment
https://anabizcollection.weeblysite.com/