I noticed on the Arabic example they lost a space after the first letter on the ...

resiros · 2025-03-06T19:38:17 1741289897

Arabic speaker here. No, it's perfect.

evmar · 2025-03-06T19:54:27 1741290867

I am pretty sure it added a kasrah not present in the input on the 2nd to last line. (Not saying it's not super impressive, and also that almost certainly is the right word, but I think that still means not quite "perfect"?)

gl-prod · 2025-03-06T19:59:11 1741291151

Yes, it looks like it did add a kasrah to the word ظهري

yoda97 · 2025-03-06T22:09:18 1741298958

Yep, and فمِنا too, this is not just OCR, it made some post-processing corrections or "enhancements". That could be good, but it could also be trouble the 1% chance it makes a mistake in critical documents.

gl-prod · 2025-03-06T19:58:00 1741291080

He means the space between the wāw (و) and the word

evmar · 2025-03-06T20:00:53 1741291253

I added a pic to the original comment, sorry for not being clear!

albatrosstrophy · 2025-03-06T22:00:36 1741298436

And here I thought after reading the headline: finally a reliable Arabic OCR. I've never in my life found a good that does the job decently especially for a scanned document. Or is there something out there I don't know about?