I’m sure a lot of good answers will come, but I’ll leave my one penny on the question.
In my research I saw a lot about the Tesseract from Google which is one of the most efficient OCR codes there is. But I didn’t develop anything, it was just research.
I did some tests, but without a customization for the specific purpose, which is the resolution of the captcha Recipe, you will have many errors for very few hits.
The latest version of Tesseract
is the 3.02.02, so far.
Until version 2.32, if I’m not mistaken, it was possible to wrapper the Tesseract library which is written in C++ for C and so use more easily by other languages. I think today, like the Tesseractenginewrapper for . Net, it is easier to try to find something that already does her Wrapper for Java, which is your case.
And I came to see also online services that offer to make the decaptcha for you, as the captchabot.com and the deathbycaptcha.com. But I didn’t test them either.
Ah a certain discussion about whether this is cool or not.
There’s a guy who implemented something with these services, he sends the captcha to the API of one of these sites, the site does the decaptcha for it and returns the text, and then he performs the access and works the HTML, see here.
His blog is: http://fsist.blogspot.com.br. Good, but it’s just so you know what’s already been done.
But I would like to encourage you to search by download of Nfe, the XML itself, directly by the Webservice of the Internal Revenue Service, just need to have the certificate.
Most likely this should be done directly by the Nfe api itself. If the note involves a vendor, it is correct to import from XML. If it’s yours or your client’s, you have the data. If it’s third-party, the api lets you see if it’s valid. For the rest, if there is no involvement with either party, captcha is precisely to avoid third party data scanning bots.
– Bacco
Hello Bacco, I fit in the bots option, I am neither the sender nor the recipient, so I can not use the service of SEFAZ, I do the freight and to prevent my team have to enter all the information of NFE manually want to download, because, the volume is huge.
– user3146356
Whoa, look at the active issues: decoder-to-captcha-code-open
– user3628
An alternative would be to import the captcha into your application so your team would only type the captcha. Now, I think the client would send you XML in a good, no?
– Bacco
Many clients do not even know what is xml, however, some already rule, but are the minority.. Thanks Bacco
– user3146356