[HN Gopher] llama-fs: A self-organizing file system with llama 3
       ___________________________________________________________________
        
       llama-fs: A self-organizing file system with llama 3
        
       Author : archb
       Score  : 153 points
       Date   : 2024-05-26 19:00 UTC (4 hours ago)
        
 (HTM) web link (github.com)
 (TXT) w3m dump (github.com)
        
       | bberenberg wrote:
       | I wonder how well this will pair with a search first UX since
       | finding stuff may be hard otherwise.
        
       | dist-epoch wrote:
       | Finally hope for our digital mountains. In a few years you'll
       | point the AI at it and it will organize it for you.
        
         | poisonborz wrote:
         | I think that would be a mental degradation. Organising the data
         | is at least as important as knowing what the data is. If you
         | don't have a mental map of what you have, you will not know
         | what you can search. I don't really mean file contents or dates
         | here, that sort of auto-organisation is long possible already.
        
         | noman-land wrote:
         | In a few years you'll point the AI at your life and it will
         | live it for you.
        
           | ben_w wrote:
           | In the Futurama sense?
           | 
           | https://youtu.be/LCPhbN1l024?si=Lsf9Uuors1Dl56Yy
        
           | blooalien wrote:
           | Great! That will free me up to view advertisements 24/7! Win-
           | win!
        
         | shaan7 wrote:
         | I'm more excited for the day we can have robot do that for the
         | analog stuff instead. I'd rather sit and enjoy organizing files
         | on my computer than loading the washing machine, the
         | dishwasher, folding clothes, packing for vacations .. well you
         | get the point.
         | 
         | Would be cool if I can build CI pipelines for daily stuff, just
         | describe everything in YAML* and not have to do repetitive
         | tasks all the time.
         | 
         | * or, hopefully something better that isn't a pain to write
        
       | outside1234 wrote:
       | glue + tomatoes = recipe directory
        
       | behnamoh wrote:
       | Apple will probably add this feature and more to the stacks
       | feature on macOS (a multimodal model would be very useful there).
       | Even better: I expect Apple to use ML and local models to scan
       | file contents and have them show up in search (e.g., on spotlight
       | or Raycast, search for the picture of my latest receipt that I
       | saved __somewhere__ I don't remember).
        
         | verdverm wrote:
         | A few months ago, while using search in finder, I noticed that
         | it would return images with the search term in the image. They
         | seem to be doing something ML already
        
           | iterateoften wrote:
           | It's been like that for years. It does OCR of text, object
           | names (like cat) and groups by person / pet, etc.
           | 
           | Most stuff like that doesn't need an LLM and would probably
           | decrease utility.
        
         | ravetcofx wrote:
         | They Have stacks already which kind of do a similar thing
         | https://support.apple.com/en-ca/guide/mac-help/mh35846/14.0/...
        
       | idncsk123 wrote:
       | Working on something related, github.com/idncsk/canvas, canvas-
       | server, canvas-ui-browser, always use the dev or for browser the
       | tarbor branch, waiting on my burger in Budapest sry for the lack
       | of details
        
         | jjuliano wrote:
         | That's an interesting project and concept.
        
       | jjuliano wrote:
       | I build a tool with similar goal a year ago:
       | https://github.com/jjuliano/aifiles -- A CLI that manages files
       | using AI. It helps me to organize my files and backups.
       | 
       | In my case, I built a local OpenAI-emulated proxy API to run
       | against a local LLM, and used a modified OpenAI library to
       | connect with it. This was the solution a year ago. Now, it's
       | easier to deploy a local LLM.
        
       | pratik_kanthi wrote:
       | I would like to see something similar for my browser tabs which
       | are always a mess. Unsure what UX considerations are needed.
       | Thoughts?
        
         | philippgerard wrote:
         | Arc Browser does that already, also for downloaded files. Looks
         | neat.
        
           | cpuguy83 wrote:
           | +1 for Arc
        
         | ilaksh wrote:
         | chrome recently rolled out something that groups browser tabs
         | together. I thought they said they used AI. But basically a
         | bunch of youtube tabs get consolidated into a youtube button
         | tab that toggles the group to expand.
        
           | pratik_kanthi wrote:
           | Makes sense. Wouldn't grouping based on the context of what
           | i'm browsing be more desirable? If i'm searching a bug fix
           | i'll probably be doing it across multiple domains, perhaps a
           | tab group based on that.
        
         | qwertox wrote:
         | Also with the possibility to look at the current tab and open a
         | new window populated with tabs from related pages you've
         | visited earlier.
        
         | pama wrote:
         | The high level strategy would be to get the browser tab
         | contents and then ask a local LLM to organize the tabs based on
         | their full content. If you run selenium or some developer
         | versions of certain browsers you might be able to source the
         | full contents directly, including any state that may not be
         | obvious from only the URL. If the url of the tabs is enough
         | (for most cases it should be), then there are many options and
         | relatively easy implementations possible. Emacs has tools to
         | communicate with browsers (though they depend on the local OS
         | and some are limited to only certain browsers or certain OSs),
         | so if you are happy with controlling the tabs from Emacs, you
         | could simply reorganize/regroup the tabs within an Emacs buffer
         | with the help of an LLM that gets the url and may open up
         | connections to see what the trivially accessible contents aee.
         | I would use this and might test this idea when I am not AFK. If
         | Emacs is not an option, perhaps find an OS or extension-
         | dependent way to reorganize the tabs.
        
           | pratik_kanthi wrote:
           | an extension seems the most natural way to do this, but that
           | would entail hosting a model which isn't cheap, will give it
           | a go
        
             | IncreasePosts wrote:
             | Can extensions connect to localhost? IE to a local ollama
             | for example.
        
       | delijati wrote:
       | Can i run this in a sandbox or dry ... Not that i'm not trusting
       | my AI Overlord ;)
        
       | haolez wrote:
       | Will this actually move stuff around? I'd prefer that it mounted
       | in another directory, giving me an organized view of my files but
       | not actually moving them.
        
         | mihalycsaba wrote:
         | Exactly this. If it starts moving files around, who knows where
         | will they end up.
        
         | bossyTeacher wrote:
         | Hopefully not. LLMs hallucinate. It will move some critical
         | files somewhere random n levels deep in some node folder you
         | will never be able to find
        
           | outofpaper wrote:
           | It's perfect for organizing people's /tmp folder
        
           | brulard wrote:
           | /dev/null seems like a nice destination
        
         | nestorD wrote:
         | A `dry run` mode producing a symbolic link based reorganized
         | copy of the folder would be nice.
        
           | pawelduda wrote:
           | Hmm or just make it produce symbolic links by default, and
           | then if you want, allow you to "commit" changes, which would
           | actually move these files. Any downsides to this?
           | 
           | Could also have integrity checks that total number of files
           | and their attributes didn't change after the commit
           | 
           | Cool project OP
        
         | spot5010 wrote:
         | Yeah, the thought of some LLM based hallucinating piece of
         | software moving my files around gives me anxiety.
        
       | accrual wrote:
       | This is pretty neat. Can confirm my Downloads folder could use
       | some help, there's usually at least one or two nested "Old
       | Downloads" or "Sort me" folders.
       | 
       | I think one thing to improve the readme or landing page for this
       | project would be a before & after for a sample ~/Downloads
       | directory, maybe in `tree` format.
        
       | vessenes wrote:
       | So, this terrifies me, in that it is going to rename files and
       | move them. I have an inordinate amount of fear about that
       | happening in the hands of an LLM. But, I love the idea of a
       | virtual filesystem / directory that lets me see things based on
       | LLM-led naming. Just, leave my main files alone, oh please god,
       | don't touch them.
       | 
       | With virtual systems, I think it could be really interesting, you
       | could have a few different types, from conceptual, to project, to
       | research area. That would be amazingly cool.
        
       | nicklecompte wrote:
       | A lot of people are worried about Llama screwing up, and that's a
       | valid concern. But this is also an Electron app + a few
       | nontrivial Python scripts for watching changes to a filesystem,
       | yet there are _zero_ actual tests. Just some highly
       | unrepresentative  "sample data."
       | 
       | I am a grumpy AI hater. But Llama is not the security/data risk
       | here. I don't think anyone should use this unless they are
       | interested in contributing.
        
         | outofpaper wrote:
         | Oh come now no need to be grumpy. We need to just accept that
         | this is somewhere between managing your files using an
         | algorithm that integrates a roulette wheel and a system that
         | instead has Russian roulette built in. In either case its going
         | to get messy.
        
       | endofreach wrote:
       | Interesting. Literally just this week i planned some time in to
       | experiment with LLMs as a full FS. This seems not truly fair to
       | be named "FS". But cool approach. Though this has been done in
       | bash scripts online before. So i only dislike naming it FS. It's
       | not an FS. Just like your productivity app is not an OS. Will
       | check it out regardless. Congratulations anyway!
        
       | gregwebs wrote:
       | I think I would like a tool that intelligently suggested renames
       | but doesn't automatically do them.
       | 
       | Arc Browser has an AI rename feature (for downloaded files). I
       | tried it out but I had to turn it off. I love Arc Browser BTW and
       | their AI hover summary is useful. I found poorly naming of files
       | to be disruptive and it's a lot better if I am more involved in
       | renaming the file- that will help me remember it.
        
         | olooney wrote:
         | That's the approach I took on a similar toy project[IT] I've
         | been working on the past week (images instead of text.) It
         | first creates a `metadata.csv` file with suggested clean
         | filenames and a Boolean flag indicating if it thinks it needs
         | to be changed at all. You can manually view and edit the
         | `metadata.csv` file and only once you're happy with it do you
         | pull the trigger by running `autorename()`. I definitely feel
         | like you need a human in the loop for this kind of thing.
         | 
         | [IT]: https://github.com/olooney/image_tagger
        
       | cyanydeez wrote:
       | This the type of thing I plan to implement. If there's one way to
       | save things. Of course, it's definitely going to hide some files
       | eventually, so hopefully you got a log file going documenting the
       | modifications.
        
       | caseyy wrote:
       | I've written something similar in python -- a script that renames
       | documents based on their contents using Ollama. It's very easy to
       | write things like that, I recommend anyone interested in local
       | LLMs try a fun project such as that.
       | 
       | My first question was why would this be a file system rather than
       | an app or a script, but I see it's actually an Electron app and
       | python scripts, which I think is the right approach.
       | 
       | I think that something that would have a UI for automatically
       | tagging, renaming, and moving files _on request_ but not
       | constantly would be very handy. Also, if you could somehow steer
       | files into binning directories ( "if X, put it in bin ~/X, if Y,
       | put the file in bin ~/Y", "if it's an invoice or deals with
       | payment, put it in ~/Documents/Finance"), that would be cool.
       | Finally, Windows support would be amazing.
        
         | TylerE wrote:
         | Well, I see very good reasons for writing it as an actual file
         | system... it allows the rest of the ecosystem to interact with
         | it on an equal footing.
        
         | btbuildem wrote:
         | > automatically tagging, renaming, and moving files on request
         | but not constantly
         | 
         | Key point here -- I might be old, but I like my files to stay
         | where/how I left them.
        
         | 3abiton wrote:
         | I feel naming this is rather personal as we all think
         | differently, and organize files mirroring our way of thinking.
         | I find it odd to have an LLM do that. How often do you use it
         | and how does it fit withing your workflow?
        
       | saintradon wrote:
       | This is something where I can see infinite context windows really
       | working. My local llm knowing where tens of thousands of files
       | located on my computer are, what they do, which ones can be
       | moved, which ones can't etc, how to organize them. Just cleaning
       | up my absolutely messy space.
        
       | varenc wrote:
       | What makes this a "file system" and not just an application that
       | uses llama/AI to do file renaming? I was expecting something like
       | a FUSE application that offered an alternative AI-organized view
       | into a directory.
        
       ___________________________________________________________________
       (page generated 2024-05-26 23:01 UTC)