[HN Gopher] llama-fs: A self-organizing file system with llama 3
___________________________________________________________________
llama-fs: A self-organizing file system with llama 3
Author : archb
Score : 153 points
Date : 2024-05-26 19:00 UTC (4 hours ago)
(HTM) web link (github.com)
(TXT) w3m dump (github.com)
| bberenberg wrote:
| I wonder how well this will pair with a search first UX since
| finding stuff may be hard otherwise.
| dist-epoch wrote:
| Finally hope for our digital mountains. In a few years you'll
| point the AI at it and it will organize it for you.
| poisonborz wrote:
| I think that would be a mental degradation. Organising the data
| is at least as important as knowing what the data is. If you
| don't have a mental map of what you have, you will not know
| what you can search. I don't really mean file contents or dates
| here, that sort of auto-organisation is long possible already.
| noman-land wrote:
| In a few years you'll point the AI at your life and it will
| live it for you.
| ben_w wrote:
| In the Futurama sense?
|
| https://youtu.be/LCPhbN1l024?si=Lsf9Uuors1Dl56Yy
| blooalien wrote:
| Great! That will free me up to view advertisements 24/7! Win-
| win!
| shaan7 wrote:
| I'm more excited for the day we can have robot do that for the
| analog stuff instead. I'd rather sit and enjoy organizing files
| on my computer than loading the washing machine, the
| dishwasher, folding clothes, packing for vacations .. well you
| get the point.
|
| Would be cool if I can build CI pipelines for daily stuff, just
| describe everything in YAML* and not have to do repetitive
| tasks all the time.
|
| * or, hopefully something better that isn't a pain to write
| outside1234 wrote:
| glue + tomatoes = recipe directory
| behnamoh wrote:
| Apple will probably add this feature and more to the stacks
| feature on macOS (a multimodal model would be very useful there).
| Even better: I expect Apple to use ML and local models to scan
| file contents and have them show up in search (e.g., on spotlight
| or Raycast, search for the picture of my latest receipt that I
| saved __somewhere__ I don't remember).
| verdverm wrote:
| A few months ago, while using search in finder, I noticed that
| it would return images with the search term in the image. They
| seem to be doing something ML already
| iterateoften wrote:
| It's been like that for years. It does OCR of text, object
| names (like cat) and groups by person / pet, etc.
|
| Most stuff like that doesn't need an LLM and would probably
| decrease utility.
| ravetcofx wrote:
| They Have stacks already which kind of do a similar thing
| https://support.apple.com/en-ca/guide/mac-help/mh35846/14.0/...
| idncsk123 wrote:
| Working on something related, github.com/idncsk/canvas, canvas-
| server, canvas-ui-browser, always use the dev or for browser the
| tarbor branch, waiting on my burger in Budapest sry for the lack
| of details
| jjuliano wrote:
| That's an interesting project and concept.
| jjuliano wrote:
| I build a tool with similar goal a year ago:
| https://github.com/jjuliano/aifiles -- A CLI that manages files
| using AI. It helps me to organize my files and backups.
|
| In my case, I built a local OpenAI-emulated proxy API to run
| against a local LLM, and used a modified OpenAI library to
| connect with it. This was the solution a year ago. Now, it's
| easier to deploy a local LLM.
| pratik_kanthi wrote:
| I would like to see something similar for my browser tabs which
| are always a mess. Unsure what UX considerations are needed.
| Thoughts?
| philippgerard wrote:
| Arc Browser does that already, also for downloaded files. Looks
| neat.
| cpuguy83 wrote:
| +1 for Arc
| ilaksh wrote:
| chrome recently rolled out something that groups browser tabs
| together. I thought they said they used AI. But basically a
| bunch of youtube tabs get consolidated into a youtube button
| tab that toggles the group to expand.
| pratik_kanthi wrote:
| Makes sense. Wouldn't grouping based on the context of what
| i'm browsing be more desirable? If i'm searching a bug fix
| i'll probably be doing it across multiple domains, perhaps a
| tab group based on that.
| qwertox wrote:
| Also with the possibility to look at the current tab and open a
| new window populated with tabs from related pages you've
| visited earlier.
| pama wrote:
| The high level strategy would be to get the browser tab
| contents and then ask a local LLM to organize the tabs based on
| their full content. If you run selenium or some developer
| versions of certain browsers you might be able to source the
| full contents directly, including any state that may not be
| obvious from only the URL. If the url of the tabs is enough
| (for most cases it should be), then there are many options and
| relatively easy implementations possible. Emacs has tools to
| communicate with browsers (though they depend on the local OS
| and some are limited to only certain browsers or certain OSs),
| so if you are happy with controlling the tabs from Emacs, you
| could simply reorganize/regroup the tabs within an Emacs buffer
| with the help of an LLM that gets the url and may open up
| connections to see what the trivially accessible contents aee.
| I would use this and might test this idea when I am not AFK. If
| Emacs is not an option, perhaps find an OS or extension-
| dependent way to reorganize the tabs.
| pratik_kanthi wrote:
| an extension seems the most natural way to do this, but that
| would entail hosting a model which isn't cheap, will give it
| a go
| IncreasePosts wrote:
| Can extensions connect to localhost? IE to a local ollama
| for example.
| delijati wrote:
| Can i run this in a sandbox or dry ... Not that i'm not trusting
| my AI Overlord ;)
| haolez wrote:
| Will this actually move stuff around? I'd prefer that it mounted
| in another directory, giving me an organized view of my files but
| not actually moving them.
| mihalycsaba wrote:
| Exactly this. If it starts moving files around, who knows where
| will they end up.
| bossyTeacher wrote:
| Hopefully not. LLMs hallucinate. It will move some critical
| files somewhere random n levels deep in some node folder you
| will never be able to find
| outofpaper wrote:
| It's perfect for organizing people's /tmp folder
| brulard wrote:
| /dev/null seems like a nice destination
| nestorD wrote:
| A `dry run` mode producing a symbolic link based reorganized
| copy of the folder would be nice.
| pawelduda wrote:
| Hmm or just make it produce symbolic links by default, and
| then if you want, allow you to "commit" changes, which would
| actually move these files. Any downsides to this?
|
| Could also have integrity checks that total number of files
| and their attributes didn't change after the commit
|
| Cool project OP
| spot5010 wrote:
| Yeah, the thought of some LLM based hallucinating piece of
| software moving my files around gives me anxiety.
| accrual wrote:
| This is pretty neat. Can confirm my Downloads folder could use
| some help, there's usually at least one or two nested "Old
| Downloads" or "Sort me" folders.
|
| I think one thing to improve the readme or landing page for this
| project would be a before & after for a sample ~/Downloads
| directory, maybe in `tree` format.
| vessenes wrote:
| So, this terrifies me, in that it is going to rename files and
| move them. I have an inordinate amount of fear about that
| happening in the hands of an LLM. But, I love the idea of a
| virtual filesystem / directory that lets me see things based on
| LLM-led naming. Just, leave my main files alone, oh please god,
| don't touch them.
|
| With virtual systems, I think it could be really interesting, you
| could have a few different types, from conceptual, to project, to
| research area. That would be amazingly cool.
| nicklecompte wrote:
| A lot of people are worried about Llama screwing up, and that's a
| valid concern. But this is also an Electron app + a few
| nontrivial Python scripts for watching changes to a filesystem,
| yet there are _zero_ actual tests. Just some highly
| unrepresentative "sample data."
|
| I am a grumpy AI hater. But Llama is not the security/data risk
| here. I don't think anyone should use this unless they are
| interested in contributing.
| outofpaper wrote:
| Oh come now no need to be grumpy. We need to just accept that
| this is somewhere between managing your files using an
| algorithm that integrates a roulette wheel and a system that
| instead has Russian roulette built in. In either case its going
| to get messy.
| endofreach wrote:
| Interesting. Literally just this week i planned some time in to
| experiment with LLMs as a full FS. This seems not truly fair to
| be named "FS". But cool approach. Though this has been done in
| bash scripts online before. So i only dislike naming it FS. It's
| not an FS. Just like your productivity app is not an OS. Will
| check it out regardless. Congratulations anyway!
| gregwebs wrote:
| I think I would like a tool that intelligently suggested renames
| but doesn't automatically do them.
|
| Arc Browser has an AI rename feature (for downloaded files). I
| tried it out but I had to turn it off. I love Arc Browser BTW and
| their AI hover summary is useful. I found poorly naming of files
| to be disruptive and it's a lot better if I am more involved in
| renaming the file- that will help me remember it.
| olooney wrote:
| That's the approach I took on a similar toy project[IT] I've
| been working on the past week (images instead of text.) It
| first creates a `metadata.csv` file with suggested clean
| filenames and a Boolean flag indicating if it thinks it needs
| to be changed at all. You can manually view and edit the
| `metadata.csv` file and only once you're happy with it do you
| pull the trigger by running `autorename()`. I definitely feel
| like you need a human in the loop for this kind of thing.
|
| [IT]: https://github.com/olooney/image_tagger
| cyanydeez wrote:
| This the type of thing I plan to implement. If there's one way to
| save things. Of course, it's definitely going to hide some files
| eventually, so hopefully you got a log file going documenting the
| modifications.
| caseyy wrote:
| I've written something similar in python -- a script that renames
| documents based on their contents using Ollama. It's very easy to
| write things like that, I recommend anyone interested in local
| LLMs try a fun project such as that.
|
| My first question was why would this be a file system rather than
| an app or a script, but I see it's actually an Electron app and
| python scripts, which I think is the right approach.
|
| I think that something that would have a UI for automatically
| tagging, renaming, and moving files _on request_ but not
| constantly would be very handy. Also, if you could somehow steer
| files into binning directories ( "if X, put it in bin ~/X, if Y,
| put the file in bin ~/Y", "if it's an invoice or deals with
| payment, put it in ~/Documents/Finance"), that would be cool.
| Finally, Windows support would be amazing.
| TylerE wrote:
| Well, I see very good reasons for writing it as an actual file
| system... it allows the rest of the ecosystem to interact with
| it on an equal footing.
| btbuildem wrote:
| > automatically tagging, renaming, and moving files on request
| but not constantly
|
| Key point here -- I might be old, but I like my files to stay
| where/how I left them.
| 3abiton wrote:
| I feel naming this is rather personal as we all think
| differently, and organize files mirroring our way of thinking.
| I find it odd to have an LLM do that. How often do you use it
| and how does it fit withing your workflow?
| saintradon wrote:
| This is something where I can see infinite context windows really
| working. My local llm knowing where tens of thousands of files
| located on my computer are, what they do, which ones can be
| moved, which ones can't etc, how to organize them. Just cleaning
| up my absolutely messy space.
| varenc wrote:
| What makes this a "file system" and not just an application that
| uses llama/AI to do file renaming? I was expecting something like
| a FUSE application that offered an alternative AI-organized view
| into a directory.
___________________________________________________________________
(page generated 2024-05-26 23:01 UTC)