[HN Gopher] TextSnatcher: Copy text from images, for the Linux D...
___________________________________________________________________
TextSnatcher: Copy text from images, for the Linux Desktop
Author : nateb2022
Score : 331 points
Date : 2024-03-15 02:57 UTC (20 hours ago)
(HTM) web link (github.com)
(TXT) w3m dump (github.com)
| Dibby053 wrote:
| A while back I copied from somewhere this script that does the
| job nicely. #!/bin/bash # Dependencies:
| tesseract-ocr imagemagick scrot xsel IMG=`mktemp`
| trap "rm $IMG*" EXIT scrot -s $IMG.png -q 100 #
| increase image quality with option -q from default 75 to 100
| mogrify -modulate 100,0 -resize 400% $IMG.png #should
| increase detection rate tesseract $IMG.png $IMG &>
| /dev/null cat $IMG.txt | xsel -bi notify-send "Text
| copied" "$(cat $IMG.txt)" exit
| dsp_person wrote:
| I was using something like this for awhile, but I found
| tesseract did poorly quite often. That resize trick didn't seem
| to affect much. I'm not sure what pre-processing would make it
| better.
|
| I'd love to if TextSnatcher does anything to improve on this.
| The github page is opaque.
| mappu wrote:
| The source is pretty straightforward - it's calling `scrot -s
| -o` to a temp file, and then `tessaract` with no further
| preprocessing.
|
| https://github.com/RajSolai/TextSnatcher/blob/master/src/ser.
| ..
| stevesimmons wrote:
| > I found tesseract did poorly quite often
|
| The script calls Tesseract in default page segmentation mode
| (PSM 3). [1]
|
| Depending on the input text, PSM mode 11 for disconnected
| text would probably work much better. That uses the flag "--
| psm 11".
|
| [1] From the original repo: string tess_command = "tesseract
| " + file_path + " " + out_path + @" -l $lang" ;
| aidenn0 wrote:
| Having used Tesseract for OCR for other things, getting the
| right PSM _helps_ but it 's still rather terrible,
| especially for sans-serif fonts, which are common in UIs.
|
| Granted there's a lot of ambiguity in sans serif fonts,
| lower-case "L", vertical bar, and upper-case "i" can even
| be pixel-identical, but I've seen tesseract turn
| Chapter III
|
| into Chapter |l1
|
| which really surprises me. In fact, for books, I run it
| through sed to replace vertical bar with upper-case "i" and
| it significantly improved recognition.
| doix wrote:
| This is perfect for me! Having a window with a button that I
| need to click is much worse than just binding a script to a
| hotkey.
| grimgrin wrote:
| In the spirit of sharing, cuz I think this is a great script
| (thank you), I prefer using maim over scrot simply because it
| has a --nodrag option. Personally feels better when making
| selections from a trackpad. Click once, move cursor, click
| again. maim -s --nodrag --quality=10 $IMG.png
|
| 10 is scrot's 100
| raphman wrote:
| Yet another variation I have been using for ages, using
| ImageMagick's `import` tool (which probably only works on
| X11) import "$tempfile"
| TEXT=`tesseract -l eng+deu "$tempfile" stdout` echo
| "$TEXT" | xsel -i -b
| hiAndrewQuinn wrote:
| I had a PowerShell script which did this as well, but alas, it
| was lost to time with the rest of my little scripts from my
| last job.
|
| Apologies to all of my fellow Unix-Windows borderers.
| Arch-TK wrote:
| trap "rm $IMG*" EXIT
|
| see https://www.shellcheck.net/wiki/SC2064
|
| also, use mktemp -d and recursively delete the directory
| cfiggers wrote:
| For my fellow Windows-using plebians, the official Microsoft
| PowerToys add-in [0] has a feature that does this (it's also been
| added to the stock screenshot tool, but I personally find the one
| keyboard shortcut in PowerToys more pleasant to use).
|
| [0] https://github.com/microsoft/PowerToys
| fredzel wrote:
| Snipping tool build in OCR works for multiple languages
| (English, Russian, Chinese, Japanese etc.) without the need to
| install any language OCR packs though
| lysp wrote:
| Inbuilt snip tool does that too.
|
| WIN+SHIFT+S
|
| If it doesn't have the "Text actions" icon (dashed square with
| paragraph lines in it), you can update it via windows store to
| get the latest version.
| schappim wrote:
| There is a utility available for macOS that extends beyond simply
| opening a document in Preview and attempting to select the text:
| https://github.com/schappim/macOCR
|
| I like the author.
| lelandfe wrote:
| FWIW one can skip Preview and just do Cmd-Shift-3, click the
| thumbnail, and interact with the text in the quicklook. Then,
| delete the image (trashcan in top right). Cmd-A works, too.
| Here's me using it on that comment: https://imgur.com/a/q0NvcS6
| helsinkiandrew wrote:
| Thank You!
| vmoore wrote:
| > for the Linux Desktop
|
| Caveat: This is a Flatpak and not all Linux distros ship with
| Flatpak. But I'll give it a whirl in my Fedora virtual machine.
| I've seen many flavors of this type of tool floating about, most
| of them leveraging Tesseract[0], and I've tried a few of them. It
| fails badly on grainy / noisy images or where the text is warped
| or skewed in some way. It will _not_ solve CAPTCHAs for you!
|
| [0] https://tesseract-ocr.github.io/tessdoc/Home.html
| Retr0id wrote:
| Which distros does flatpak not work on?
| adhamsalama wrote:
| Hannah Montana.
| PlutoIsAPlanet wrote:
| I'm sure it would work if you built it from source.
| SushiHippie wrote:
| https://flatpak.org/setup/
|
| Flatpak should work on every distro. However, it may not be
| included by default, so you need to install flatpak before
| installing this application.
| PlutoIsAPlanet wrote:
| Not really a Caveat, if it only had an Deb you could argue it
| doesn't work on non-Ubuntu/Debian, which is far bigger caveat.
| akho wrote:
| ...it's a pile of Vala code. What you probably mean is that the
| author did not make a package for your distribution, and there
| is no one else who had the time and inspiration to package it.
| You can be the maintainer you seek.
| Zambyte wrote:
| I wrote a script a while back gluing wofi (for dispatching
| several screenshot related tasks), grim, and tesseract together.
|
| https://robbyzambito.me/posts/tips-and-tricks-for-taking-scr...
| yjftsjthsd-h wrote:
| Suggestion: You could just mkdir -p
| ~/Pictures/Screenshots
|
| and not have to warn the user to create it.
| xiwenc wrote:
| I got a similar solution on iOS as a shortcut connected to the
| action button. Some apps doesn't allow easy text copy. Or when
| it's in a foreign language. It does:
|
| - take screenshot
|
| - extract text from it
|
| - translate the text to english. Auto detects source language
|
| - show both original and translated text in quick view where you
| can select and copy if desired.
|
| Here is a implementation you can try:
|
| https://www.icloud.com/shortcuts/f420d24e4960415da1a43f230ab...
|
| While on the subject of iOS. In recent versions when you open a
| photo in the photos app you can also select the text in the photo
| by hand and copy it.
| sunnybeetroot wrote:
| This is fantastic thanks for sharing. I have used it in the
| share sheet when tapping share on an image and it works but
| given I am already providing the image, the screenshot is
| redundant.
| rounakdatta wrote:
| Interesting, I've always resorted to using Google Lens via the
| phone for this purpose. And then using the "Copy to another
| device" feature of Chrome.
| osbkca wrote:
| A great tool. But it only use on Linux. I found Xclippy
| (https://xclippy.com/) tool. It available on Windows and MacOs
| crooked-v wrote:
| Note that this is alreay a (non-obvious) built-in feature on
| Mac and iPhone, called "Live Text". See these articles for
| examples:
|
| https://support.apple.com/guide/preview/interact-with-text-i...
|
| https://support.apple.com/guide/photos/interact-with-text-in...
|
| https://support.apple.com/en-us/HT212630
| LudwigNagasena wrote:
| On iPhone you can even search your images by text content.
| dotancohen wrote:
| For completeness sake, Samsung phones with the S-Pen can also
| OCR. That would be, the old Note series and now the S-Ultra
| phones.
| Pokerface777 wrote:
| very bad UI
| gpuhacker wrote:
| Surprises me to see I'm the first comment here to say: I just use
| GPT4 for this. Works perfectly, even for getting the Latex source
| of a formula you only have a screenshot of.
|
| Probably quite the overkill in terms of energy efficiency for
| just image to text, but I only need this like once every two
| weeks or so.
| rvdca wrote:
| The similar app I am using is Frog (https://getfrog.app) with
| great sucesss.
| mathfailure wrote:
| No AppImage, no .deb, not even brew.
| ssernikk wrote:
| It's on nixpkgs under name `gnome-frog` (for nix users)
| bpfrh wrote:
| I use the same script as Dibby053, copied from stackoverflow but
| with some tweaks to work on kde,gnome and wayland as well as x11
| and with some notifications on what state it is in.
|
| I didn't test the x11/wayland check yet, but feel free to use it
| and report back. #!/bin/bash #
| Dependencies: tesseract-ocr imagemagick # on gnome: gnome-
| screenshot # on kde: spectacle # on x11: xsel
| # on wayland: wl-clipboard die(){ notify-send
| "$1" exit 1 } cleanup(){ [[ -n $1 ]] &&
| rm -rf "$1" } SCR_IMG=$(mktemp) || die "failed
| to take screenshot" # shellcheck disable=SC2064
| trap "cleanup '$SCR_IMG'" EXIT notify-send "Select the
| area of the text" if which "spectacle" &> /dev/null
| then spectacle -r -o "$SCR_IMG.png" || die "failed to
| take screenshot" else gnome-screenshot -a -f
| "$SCR_IMG.png" || die "failed to take screenshot" fi
| # increase image quality with option -q from default 75 to 100
| mogrify -modulate 100,0 -resize 400% "$SCR_IMG.png" || die
| "failed to convert image" #should increase detection rate
| tesseract "$SCR_IMG.png" "$SCR_IMG" &> /dev/null || die "failed
| to extract text" if [ "$XDG_SESSION_TYPE" == "wayland" ]
| then wl-copy < "$SCR_IMG.txt" || die "failed to copy text
| to clipboard" else xsel -b -i < "$SCR_IMG.txt" ||
| die "failed to copy text to clipboard" fi notify-send
| "Text extracted" exit
|
| edit:
|
| Formatting
| rjzzleep wrote:
| I like all the error handling, but you could skip the temp
| files if you just pipe it through
| #!/usr/bin/env bash langs=(eng ara fas chi_sim chi_tra
| deu ell fin heb hun jpn kor nld rus tur) lang=$(printf
| '%s\n' "${langs[@]}" | fuzzel -d "$@") grim -g
| "$(slurp)" - | mogrify -modulate 100,0 -resize 400% png:- |
| tesseract -l eng+${lang} - - | wl-copy notify-send
| "Text extracted"
| miduil wrote:
| If you just put `set -o errexit -o pipefail -o nounset` in
| the first line after the shebang your script will have proper
| error-handling as well. Currently if any fails, notify-send
| will still be triggered.
| bpfrh wrote:
| This version looks nice and short, any thoughts on prober
| error reporting to the end user?
|
| My version has more feedback for the user which was important
| because the user was somebody not familiar with linux/bash,
| but even my version "swallows" errors.
| rjzzleep wrote:
| I added the `set pipefile...` suggested below, but I think
| mogrify only fails if the screenshot fails. Tesseract never
| fails if there is a valid input image, so realistically you
| only need one error message for the screenshot generation,
| unless you want to check whether the user misses any of the
| tools.
| tmerse wrote:
| I also used the very same script until I stumbled upon this on
| hn [0]. #!/usr/bin/env bash
| langs=(eng ara fas chi_sim chi_tra deu ell fin heb hun jpn kor
| nld rus tur) lang=$(printf '%s\n' "${langs[@]}" | dmenu
| "$@") maim -us | tesseract --dpi 145 -l eng+${lang} - -
| | xsel -bi
|
| [0]: https://news.ycombinator.com/item?id=33704483#33705272
| tmerse wrote:
| Ah just saw rjzzleep posted an updated version here. Happy to
| steal this one again :)
| begueradj wrote:
| Looks nice
| Arch-TK wrote:
| # shellcheck disable=SC2064 trap "cleanup '$SCR_IMG'"
| EXIT
|
| While shellcheck can have false positives, and SCR_IMG probably
| doesn't have any characters which need escaping, it's not
| exactly wrong in this case.
|
| The command passed to `trap` is evaluated normally, so variable
| expansions do take place. trap 'cleanup
| "$SCR_IMG"' EXIT
|
| Will behave correctly, and the expansion of SCR_IMG won't be
| susceptible to issues relating to unquoted shell characters.
|
| Alternatively, if you're using a modern bash (this probably
| won't work on a mac by default), then this is an option too:
| trap "cleanup ${SCR_IMG@Q}" EXIT
| bpfrh wrote:
| thanks for fixing and explaining that, I thought '' would
| work and forgot about escaping characters.
| jonquark wrote:
| The Wayland leg works fine for me on gnome+wayland.
| bpfrh wrote:
| thanks!
| guipsp wrote:
| I slightly modified your script to: 1. Clean up properly 2. Run
| spectacle in BG mode, so the window does not pop up after
| screenshotting. #!/bin/bash #
| Dependencies: tesseract-ocr imagemagick # on gnome:
| gnome-screenshot # on kde: spectacle # on x11:
| xsel # on wayland: wl-clipboard die(){
| notify-send "$1" exit 1 } cleanup(){
| [[ -n $1 ]] && rm -r "$1" } SCR_IMG=$(mktemp
| -d) || die "failed to take screenshot" #
| shellcheck disable=SC2064 trap "cleanup '$SCR_IMG'" EXIT
| #notify-send "Select the area of the text" if which
| "spectacle" &> /dev/null then spectacle -b -r -o
| "$SCR_IMG/scr.png" || die "failed to take screenshot"
| else gnome-screenshot -a -f "$SCR_IMG/scr.png" || die
| "failed to take screenshot" fi # increase
| image quality with option -q from default 75 to 100
| mogrify -modulate 100,0 -resize 400% "$SCR_IMG/scr.png" || die
| "failed to convert image" #should increase detection rate
| tesseract "$SCR_IMG/scr.png" "$SCR_IMG/scr" &> /dev/null || die
| "failed to extract text" if [ "$XDG_SESSION_TYPE" ==
| "wayland" ] then wl-copy < "$SCR_IMG/scr.txt" ||
| die "failed to copy text to clipboard" else xsel
| -b -i < "$SCR_IMG/scr.txt" || die "failed to copy text to
| clipboard" fi notify-send "Text extracted"
| exit
| bpfrh wrote:
| Good catch with spectacle, I thought I fixed that already.
|
| Why did you remove the -f parameter?
| palmy wrote:
| This is great!
|
| Also made some minor modifications: replaced `xsel` with
| `xclip` and added truncated version of the copied text to the
| `notify-send`: #!/bin/bash #
| Dependencies: tesseract-ocr imagemagick # on gnome:
| gnome-screenshot # on kde: spectacle # on x11:
| xsel # on wayland: wl-clipboard die(){
| notify-send "$1" exit 1 } cleanup(){
| [[ -n $1 ]] && rm -r "$1" } SCR_IMG=$(mktemp
| -d) || die "failed to take screenshot" #
| shellcheck disable=SC2064 trap "cleanup '$SCR_IMG'"
| EXIT #notify-send "Select the area of the text"
| if which "spectacle" &> /dev/null then
| spectacle -n -b -r -o "$SCR_IMG/scr.png" || die "failed to
| take screenshot" else gnome-screenshot -a -f
| "$SCR_IMG/scr.png" || die "failed to take screenshot"
| fi # increase image quality with option -q from
| default 75 to 100 mogrify -modulate 100,0 -resize 400%
| "$SCR_IMG/scr.png" || die "failed to convert image"
| #should increase detection rate tesseract
| "$SCR_IMG/scr.png" "$SCR_IMG/scr" &> /dev/null || die "failed
| to extract text" if [ "$XDG_SESSION_TYPE" == "wayland"
| ] then wl-copy < "$SCR_IMG/scr.txt" || die
| "failed to copy text to clipboard" else # xsel
| -b -i < "$SCR_IMG/scr.txt" || die "failed to copy text to
| clipboard" xclip -selection clipboard -i <
| "$SCR_IMG/scr.txt" || die "failed to copy text to clipboard"
| fi # Notify the user what was copied but truncate the
| text to 100 characters notify-send "Text extracted from
| image" "$(head -c 100 "$SCR_IMG/scr.txt")" || die "failed to
| send notification" exit
| gerardnico wrote:
| Within Google photos mobile on iOS, you got an ocr.
|
| I take a photo, grab the text and send it via WhatsApp web app.
|
| Not easy as a clip to clipboard but I haven't found any on
| windows.
| zuhsetaqi wrote:
| On iOS it's system wide. It's an iOS and also a macOS feature
| noselasd wrote:
| Another neat thing is when you copy the text on your iOS
| device, it appears in your clipboard on Mac, so you can just
| paste it. (Assuming both devices are on the same wifi/local
| network.
| walteweiss wrote:
| And you need to select the check box somewhere in the
| settings app for it to work.
|
| Source: helped a friend with the feature recently, he
| didn't know it exists.
| talhah wrote:
| If you're on a Pixel 7 and upwards or the latest Samsung phones
| there's also circle to search by holding the home button down.
| The OCR works quite well including English, Russian, Arabic,
| Japanese and I'm sure it works on other languages too.
|
| If you're on Android 14 you can also copy text through the
| recents/overview menu simply by highlighting the text. And
| finally there's Google Photos if you don't have any of these
| features.
|
| There's also Google lens if you're trying to copy text that
| isn't on your screen.
| passion__desire wrote:
| You can take a screenshot. Open that image file with Chrome and
| then do "Search Images with Google" . There you can grab the
| text.
| usr1106 wrote:
| I see tesseract mentioned more and more.
|
| Myself I tried it probably 10-15 years ago on scanned scientific
| papers (decent scanning quality). The results were disappointing.
| The manual postprocessing required was not much less than typing
| it directly. So tesseract became a synonym of "not worth trying"
| to me.
|
| Maybe things have improved over the years, so I should give it a
| new try. (No particular use case at the moment, but those tend to
| appear occasionally.)
| mellutussa wrote:
| Try https://github.com/ocrmypdf/OCRmyPDF - it uses Tesseract
| behind the scenes and it absolutely brilliant.
| walteweiss wrote:
| First time I used it 3 to 4 years ago, it was good.
| mkl wrote:
| It's way better now. I used it 15 years ago and had to do quite
| a bit of preprocessing to get not-entirely-terrible results,
| but now I use it with great success and no preprocessing.
| graynk wrote:
| It's good now _if_ you OCR only scanned documents or otherwise
| have a lot of control over how you prepare the images before
| it's OCR'ed. For more general purpose recognition with weird
| fonts and bad image quality EasyOCR gave me much better results
| sp332 wrote:
| This project is including Tesseract 4.1.1 which is at least a
| couple years old.
| littlestymaar wrote:
| Why is this posted now? The repo has seen no activity in the past
| two years, and the https certificate on the website is also
| obsolete since 2022 so I'm not sure it is still alive...
| Pokerface777 wrote:
| there's probably a lot of software that wasn't updated in the
| last 10 years that could still be really useful
| littlestymaar wrote:
| Sure, but at the same time if it's not included on
| distributions and not updated by upstream it's likely to have
| compatibility issues relatively quickly (GTK is particularly
| bad at maintaining compatibility between versions, even point
| releases).
|
| Also code not being updated is something, TLS certificate not
| being renewed is in an other league in terms of lack of
| support of the project.
| Pokerface777 wrote:
| fair points... but I feel like that it is problems that
| needs to be solved. Emulators and VMs do fix some of those.
| ddtaylor wrote:
| It's a shame this is for elementaryOS as those apps typically do
| not work correctly on other basic Gnome systems.
| Dwedit wrote:
| How is OCR these days? Lately I'm seeing more deeplearning-based
| OCR, and it gives you significantly different results just by
| cropping the image differently.
| jbverschoor wrote:
| Preview on macOS does it automatically. No tools needed.
| carlesfe wrote:
| Preview is one of the most underrated apps in macOS, and the
| one I miss the most when I use Linux or Windows. It's a great
| little toolbox for quick editing and convenience features.
| jjice wrote:
| I didn't realize it was underrated, but it's probably the
| best MacOS bundled software. If I could get a Linux
| equivalent, that would be fantastic. Viewing, editing, PDF
| filling, PDF signing (so useful), all in a fast and
| responsive tool is just incredible.
|
| If anyone has anything near a Linux equivalent, please let
| me know.
| askvictor wrote:
| Would be great if flameshot had this feature. It's otherwise the
| best screenshot tool I've ever come across
| jchw wrote:
| I gave it a try. Works pretty good.
|
| Being a Flatpak app, it will require desktop portals to fully
| work. That said, it worked absolutely fine out of the box for me
| with my existing xdg-desktop-portal-wlr setup. So, it should work
| fine in any X11 or Wayland setup where you have an xdg-desktop-
| portal setup that supports the Screenshot API.
|
| The results are mixed, but not bad by any means. Cleanly readable
| text comes out mostly fine with maybe only whitespace issues and
| the occasional error, which makes this still potentially very
| useful for copying text out of error dialogs and whatnot.
| (Though, I've found that on Linux, error dialogs are far more
| likely to have selectable text in the first place. And on
| Windows, standard MessageBox responds to Ctrl+C.)
| imhoguy wrote:
| Is there anything that could handle indentation? I use very
| similar tool on Linux already (also available on Windows and
| Mac): https://dynobo.github.io/normcap/
| igtztorrero wrote:
| Vala , first time I hear about it !!!
| askl wrote:
| I'm using normcap[1] for this. The workflow feels a bit more
| polished (Though also not perfect) and the repo is still active.
|
| [1] https://github.com/dynobo/normcap
| nathansherburn wrote:
| Just three hours ago I switched back to Linux after a few years
| on MacOS. The only thing missing was the amazing text copy tool I
| was using, "Rex" [1]. What a coincidence to see this post on the
| front page a few hours later!
|
| Side note, what a breath of fresh air Gnome on Fedora is!
|
| [1] https://github.com/amebalabs/TRex
| fnord77 wrote:
| On a side note, sonoma and ios have this functionality built in
| now.
| nico wrote:
| I'm running Monterey and have the feature, but it's only
| inside Preview, which means I need to either open the image
| in Preview or take a screenshot and then open a new image in
| Preview to paste the screenshot, before getting the text
| throwaway290 wrote:
| should work in quicklook and safari at least since Ventura.
| Does for me
| rhettbull wrote:
| Check out Textinator [0] which is an open source macOS app
| that watches for screenshots and automatically does text
| detection then copies text to clipboard. (Disclaimer: I'm
| the author). It works on macOS 10.15+
|
| [0]: https://github.com/RhetTbull/textinator
| freedomben wrote:
| It's obviously personal opinion, but I think you made the best
| choice! (Gnome on Fedora). Welcome back!
|
| It's remarkable how much more polished Gnome is from a few
| years ago. If you use 2FA TOTP, make sure to install Gnome
| Authenticator if you haven't already. If you use Aegis on
| Android (or a handful of other formats) it can import/export
| your seeds. It is downright luxurious having this on my
| laptop/desktop: # If you haven't setup
| flathub yet sudo flatpak remote-add --if-not-exists
| flathub https://flathub.org/repo/flathub.flatpakrepo
| # Install Authenticator from flathub. Source:
| https://gitlab.gnome.org/World/Authenticator sudo
| flatpak install flathub com.belmoussaoui.Authenticator
| BHSPitMonkey wrote:
| You can also add TOTP secrets to entries in KeePassXC and
| generate/copy codes there (Ctrl+T).
| dobicinaitis wrote:
| Another variant of the scripts floating around that I've been
| using to scratch the same itch: #!/bin/bash
| # Performs Optical Character Recognition (OCR) on a freely chosen
| # screen area and copies the recognized text to the clipboard.
| # # Dependencies: sudo apt install gnome-screenshot
| tesseract-ocr xclip IMAGE_FILE="/tmp/ocr.png"
| gnome-screenshot --area --file "$IMAGE_FILE" tesseract
| "$IMAGE_FILE" - | xclip -rmlastnl -selection clipboard
| rm -f "$IMAGE_FILE"
| mergy wrote:
| Compiled on Deb 12.5.x - pretty cool. Thank you.
| lacoolj wrote:
| thats pretty cool but definitely has a ways to go (in the example
| on github page even shows a few discrepancies between original
| and pasted text - seems to be mostly punctuation though)
|
| very nice though thanks for sharing!
| sp332 wrote:
| Looks like it hasn't veen updated in a couple years.
| dark-star wrote:
| It seems like this tool sends your screenshot to some sort of web
| service.
|
| If that's really the case then obviously don't use it for
| personal data (invoices, love letters, legal proceedings, ...)
| sp332 wrote:
| Why would it be using Tesseract if it also uses an external
| service? And who's paying for the service?
| jchw wrote:
| I didn't see anything of the sort looking through the source
| code. I see it uses portals (or scrot) to take a screenshot,
| and spawns Tesseract as an external process.
|
| https://github.com/RajSolai/TextSnatcher/blob/9e67760d6c16ea...
|
| Tesseract itself seems to be included in the Flatpak as you'd
| expect:
|
| https://github.com/RajSolai/TextSnatcher/blob/master/manifes...
|
| Where did you get that?
| makach wrote:
| Excellent, just like text extractor in windows powertoys! Love
| this!
| avipars wrote:
| Does this work in languages besides english?
| rhettbull wrote:
| For macOS users, I'm the author of Textinator [0] a similar
| utility for macOS that uses Apple's Vision framework [1] for
| doing the OCR natively. Modern versions of macOS (since Sonoma)
| have a similar ability to copy text from images using the Live
| Text feature [2] but Textinator works on macOS Catalina and later
| and simplifies the "take screenshot, copy text to clipboard"
| workflow. It's also an example of how to build a native macOS app
| entirely in Python.
|
| [0]: https://github.com/RhetTbull/textinator
|
| [1]:
| https://developer.apple.com/documentation/vision?language=ob...
|
| [2]: https://support.apple.com/guide/preview/interact-with-
| text-i...
| NegativeLatency wrote:
| Been using this little script for mac os to copy text out of
| images without going through Preview.app:
| https://github.com/nburns/utilities/blob/main/ocr
|
| (would definitely appreciate feedback/critiques from any swifties
| out there)
___________________________________________________________________
(page generated 2024-03-15 23:00 UTC)