[HN Gopher] Show HN: Repo2vec - an open-source library for chatt...
___________________________________________________________________
Show HN: Repo2vec - an open-source library for chatting with any
codebase
Hi HN, We're excited to share repo2vec: a simple-to-use, modular
library enabling you to chat with any public or private codebase.
It's like Github Copilot but with the most up-to-date information
about your repo. We made this because sometimes you just want to
learn how a codebase works and how to integrate it, without
spending hours sifting through the code itself. We tried to make
it dead-simple to use. With two scripts, you can index and get a
functional interface for your repo. Every generated response shows
where in the code the context for the answer was pulled from. We
also made it plug-and-play where every component from the
embeddings, to the vector store, to the LLM is completely
customizable. If you want to see a hosted version of the chat
interface with its features, here's a link:
https://www.youtube.com/watch?v=CNVzmqRXUCA We would love your
feedback! - Mihail and Julia
Author : nutellalover
Score : 29 points
Date : 2024-08-28 19:59 UTC (3 hours ago)
(HTM) web link (github.com)
(TXT) w3m dump (github.com)
| cool-RR wrote:
| I want to feed it not only the code but also a corpus of
| questions and answers, e.g. from the discussions page on GitHub.
| Is that possible?
| spaceship__sun wrote:
| I just need to have gemini 1.5 pro in VS code dev environment
| and pass in the entire codebase in the context window. THEY
| STILL HAVEN'T DONE THIS.
| nutellalover wrote:
| Thanks for the request! This is on our roadmap, as is
| supporting Github issues and eventually external
| documentation/code discussions from Slack, Jira/Linear, etc.
| nutellalover wrote:
| Feel free to submit an issue on the repo and we'll get to it!
| RicoElectrico wrote:
| I wonder if it will work on
| https://github.com/organicmaps/organicmaps
|
| So far two similar solutions I tested crapped out on non-ASCII
| characters. Because Python's UTF-8 decoder is quite strict about
| it.
| nutellalover wrote:
| OP here! I love this stress test. Will index and get back to
| you!
| resters wrote:
| Very useful! I was just thinking this kind of thing should exist!
|
| I would also like to be able to have the LLM know all of the
| documentation for any dependencies in the same way.
| nutellalover wrote:
| Thanks!
|
| This is a great idea. Definitely something we plan to support.
| ccgongie wrote:
| Super easy to use! Thanks! What's powering this under the hood?
| nutellalover wrote:
| The starter config is Openai embeddings + llm, pinecone vector
| store, gradio for the UI. But it's customizable so you can swap
| out whatever you want easily.
| wiradikusuma wrote:
| Is this for a specific language? Does it support polygot
| (multiple languages in 1 project)?
| nutellalover wrote:
| Yup! We use tree-sitter and parse it at the file-level.
___________________________________________________________________
(page generated 2024-08-28 23:00 UTC)