were you guys able to finish running the benchmark with mistral and got a 70% sc...

themanmaran · 2025-03-07T07:27:18 1741332438

Yup, surprising results! We were able to dig in a bit more. Main culprit is the overzealous "image extraction". Where if Mistral classifies something as an image, it will replace the entire section with (image)[image_002).

And it happened with a lot of full documents as well. Ex: most receipts got classified as images, and so it didn't extract any text.

cdolan · 2025-03-07T14:04:15 1741356255

This sounds like a real problem and hurdle for North American (US/CAN in particular) invoice and receipt processing?

lingjiekong · 2025-03-07T11:47:03 1741348023

where do you find this regarding "Where if Mistral classifies something as an image, it will replace the entire section with (image)[image_002)."?

culi · 2025-03-07T15:35:55 1741361755

themanmaran works at Omni so presumably they have access to the actual resulting data from this study