[HN Gopher] Easy-to-Use Apple Vision wrapper for text extraction...
       ___________________________________________________________________
        
       Easy-to-Use Apple Vision wrapper for text extraction and clustering
        
       Author : brulenaudet
       Score  : 27 points
       Date   : 2024-01-27 11:58 UTC (1 days ago)
        
 (HTM) web link (pypi.org)
 (TXT) w3m dump (pypi.org)
        
       | brulenaudet wrote:
       | Apple-ocr is a utility for Optical Character Recognition (OCR)
       | that facilitates the extraction of text from images. This Python-
       | based tool is designed to help developers, researchers, and
       | enthusiasts in the field of text extraction and clustering. It
       | leverages a combination of various technologies to achieve this,
       | including the Vision framework provided by Apple.
        
       | l33t7332273 wrote:
       | It uses python so I'm a bit concerned about speed. Are there any
       | benchmarks for how fast this is if I want to use it for a real
       | time application?
        
         | thih9 wrote:
         | This is a python wrapper around apple's vision framework. If
         | you're concerned about speed and/or don't want python, then
         | interacting directly with the vision framework is an option.
        
       | fotta wrote:
       | Apple Vision SDK:
       | https://developer.apple.com/documentation/vision
        
       | theolivenbaum wrote:
       | Nice! We maintain one for C# too:
       | https://www.nuget.org/packages/LiveTextSharp
        
       | bomewish wrote:
       | I've been using textra for a wrapper on the Apple vision sdk
       | 
       | https://github.com/freedmand/textra
       | 
       | But this project calls torch and a bunch of other ML libs. So
       | it's not using Apple vision?
        
         | sigmoid10 wrote:
         | It's only using those other libs to do some simplistic
         | statistical analysis of Apple vision's outputs. Further down
         | you can see it just calls VNImageRequestHandler which is
         | basically a call from Python to Apple Vision via ObjC in order
         | to get any actual recognitions.
        
       ___________________________________________________________________
       (page generated 2024-01-28 23:00 UTC)