[HN Gopher] Semantic Search with SQLite
       ___________________________________________________________________
        
       Semantic Search with SQLite
        
       Author : txtai
       Score  : 85 points
       Date   : 2022-11-21 14:03 UTC (8 hours ago)
        
 (HTM) web link (neuml.github.io)
 (TXT) w3m dump (neuml.github.io)
        
       | pantsforbirds wrote:
       | Interesting choice of a database. Duckdb seems like it is a much
       | better fit for the type of query being done. Wonder why sqlite
       | was chosen over duckdb.
        
         | txtai wrote:
         | Duckdb is on the roadmap. SQLite is the first implementation.
         | The plan is to extend the interface for other database types,
         | including Duckdb.
        
       | sthatipamala wrote:
       | Any examples of this being combined with full text search to get
       | a hybrid index?
        
         | txtai wrote:
         | Sure thing. Examples can be found here:
         | https://neuml.github.io/txtai/examples/
         | 
         | This one re-ranks the output from an Elasticsearch index -
         | https://colab.research.google.com/github/neuml/txtai/blob/ma...
         | 
         | The next major release have more examples using the local BM25
         | scoring module.
        
       | codazoda wrote:
       | It's not clear to me if txtai reaches out to the internet for all
       | these queries. I assume it does and processes much of this in the
       | cloud. That probably makes it a non-starter for much of my work.
       | I do wonder, however, since their API docs talk a little about
       | cloud options (making me wonder if non-cloud is the default).
       | But, it's not immediately obvious to me.
        
         | txtai wrote:
         | It's all local except for downloading the transformers models
         | for vectorization. TRANSFORMERS_OFFLINE_MODE can be set and
         | models downloaded manually for offline use.
        
       ___________________________________________________________________
       (page generated 2022-11-21 23:01 UTC)