[HN Gopher] Show HN: Repo2vec - an open-source library for chatt...
       ___________________________________________________________________
        
       Show HN: Repo2vec - an open-source library for chatting with any
       codebase
        
       Hi HN, We're excited to share repo2vec: a simple-to-use, modular
       library enabling you to chat with any public or private codebase.
       It's like Github Copilot but with the most up-to-date information
       about your repo.  We made this because sometimes you just want to
       learn how a codebase works and how to integrate it, without
       spending hours sifting through the code itself.  We tried to make
       it dead-simple to use. With two scripts, you can index and get a
       functional interface for your repo. Every generated response shows
       where in the code the context for the answer was pulled from.  We
       also made it plug-and-play where every component from the
       embeddings, to the vector store, to the LLM is completely
       customizable.  If you want to see a hosted version of the chat
       interface with its features, here's a link:
       https://www.youtube.com/watch?v=CNVzmqRXUCA  We would love your
       feedback!  - Mihail and Julia
        
       Author : nutellalover
       Score  : 29 points
       Date   : 2024-08-28 19:59 UTC (3 hours ago)
        
 (HTM) web link (github.com)
 (TXT) w3m dump (github.com)
        
       | cool-RR wrote:
       | I want to feed it not only the code but also a corpus of
       | questions and answers, e.g. from the discussions page on GitHub.
       | Is that possible?
        
         | spaceship__sun wrote:
         | I just need to have gemini 1.5 pro in VS code dev environment
         | and pass in the entire codebase in the context window. THEY
         | STILL HAVEN'T DONE THIS.
        
         | nutellalover wrote:
         | Thanks for the request! This is on our roadmap, as is
         | supporting Github issues and eventually external
         | documentation/code discussions from Slack, Jira/Linear, etc.
        
           | nutellalover wrote:
           | Feel free to submit an issue on the repo and we'll get to it!
        
       | RicoElectrico wrote:
       | I wonder if it will work on
       | https://github.com/organicmaps/organicmaps
       | 
       | So far two similar solutions I tested crapped out on non-ASCII
       | characters. Because Python's UTF-8 decoder is quite strict about
       | it.
        
         | nutellalover wrote:
         | OP here! I love this stress test. Will index and get back to
         | you!
        
       | resters wrote:
       | Very useful! I was just thinking this kind of thing should exist!
       | 
       | I would also like to be able to have the LLM know all of the
       | documentation for any dependencies in the same way.
        
         | nutellalover wrote:
         | Thanks!
         | 
         | This is a great idea. Definitely something we plan to support.
        
       | ccgongie wrote:
       | Super easy to use! Thanks! What's powering this under the hood?
        
         | nutellalover wrote:
         | The starter config is Openai embeddings + llm, pinecone vector
         | store, gradio for the UI. But it's customizable so you can swap
         | out whatever you want easily.
        
       | wiradikusuma wrote:
       | Is this for a specific language? Does it support polygot
       | (multiple languages in 1 project)?
        
         | nutellalover wrote:
         | Yup! We use tree-sitter and parse it at the file-level.
        
       ___________________________________________________________________
       (page generated 2024-08-28 23:00 UTC)