apeiron.aire.ml:9999

Post At38CSMLJxbXhIBHdY by mala@fosstodon.org
(DIR) More posts by mala@fosstodon.org
(DIR) Post #At38CC3fvbAJ8lK9Jo by mala@fosstodon.org
0 likes, 0 repeats

I got back home from the amazing #SocialWebFOSDEM and while I am drafting a “Build Your Own Timeline Algorithm” blog post to follow up on my talk I decided to keep the momentum going with a thread about it. You&#39;ll find my code at https://github.com/aittalam/byota and the talk slides here: https://fosdem.org/2025/schedule/event/fosdem-2025-5601-build-your-own-timeline-algorithm/ (a video with the recording of the talk should be available soon too). Now let’s dig into #BYOTA!

(DIR) Post #At38CCxgZLy5wTWsca by mala@fosstodon.org
0 likes, 0 repeats

What is BYOTA? Let’s start from what it is not: it’s not a new algorithm (you’ll see a few, but all coming from very classical ML), an end-user application, or a service.To me BYOTA is a call for arms, starting with an imperative: BUILD. I like to see it as a playground where people can easily play with, create, and share their own algorithms. My main contribution here is just taking a few relatively recent and absolutely amazing tools and putting them together.Where does this idea come from?

(DIR) Post #At38CDnRSvMuWzkDIG by mala@fosstodon.org
0 likes, 0 repeats

I like to say it all started with a sink, but for me it was about half a year before that. I was already fed up with what was happening around all social media and was looking for non-centralized alternatives. I joined the Fediverse and found some great communities of practice that resonated a lot with me. Each community had different characteristics, language, message frequency, number of participants. This is where I started to think about the idea of a “personal algorithm”.

(DIR) Post #At38CEWokDfGniyS1I by mala@fosstodon.org
0 likes, 0 repeats

But let us start with the TL algorithms we all know: they are usually not monolithic but made of smaller, more specific algorithms which contribute to decide whether and how posts appear on your timeline. If you are interested in this topic and want to delve deeper, let me point you to this excellent blog post (part of an excellent series!) by Luke Thorburn: https://medium.com/understanding-recommenders/how-platform-recommenders-work-15e260d9a15a

(DIR) Post #At38CFVR6q9bpjKrVQ by mala@fosstodon.org
0 likes, 0 repeats

For algorithms specifically running on the fediverse, let me point you to the following in sparse order:- https://searchtodon.social/- https://github.com/hodgesmr/mastodon_digest- https://www.tootfinder.ch/- http://fediview.com/- https://github.com/pkreissel/foryoufeedI am grateful I could stand on the shoulders of these giants, learning from them about both the advantages and the limitations of running algorithms on the Fediverse.

(DIR) Post #At38CGKU32zGO3Dd4a by mala@fosstodon.org
0 likes, 0 repeats

Every algorithm I saw was affected by one or more of the following problems: bias; lack of privacy, transparency, and user control; dependency on ML algorithms which are complex, computationally heavy and require to run in a centralized way. Note that reverse chronological itself -which many welcome as absence of an algorithm- is an algorithm and it is biased, as it generally favors statuses by people who write more and in the same timezone as me.

(DIR) Post #At38CHBerLWP2y65xI by mala@fosstodon.org
0 likes, 0 repeats

In practice, those are actual problems only when the objective is to retain users on your platform. However, we can break this assumption and decide we don’t need to compete with commercial algorithms: serving people’s purposes is more than enough. When you do this, you immediately realize there’s a solution to each of the above problems, one which relies on open, small, interpretable models, that can be run locally for ad-hoc tasks without requiring too much ML knowledge: local embeddings.

(DIR) Post #At38CHvO7K6LKnUcEa by mala@fosstodon.org
0 likes, 0 repeats

BYOTA is built on this assumption: do something useful while preserving user privacy and control, using open models and local-first technologies. While the concept sounds straightforward, developing something which is both easy to use and customize, computationally light, and useful in my spare time was not trivial. Luckily, in the last year I found some tools which are the right ones for the task and now BYOTA exists thanks to these three main components: Mastodon.py, llamafile, and marimo.

(DIR) Post #At38CJaXvmcOUjl01g by mala@fosstodon.org
0 likes, 0 repeats

Mastodon.py is a python client library for Mastodon and I use it to download my own posts and my instance’s timeline updates. As what we are interested in are just status contents, you can swap this library with whatever works with your setup: a generic ActivityPub client, a specific one for a different social network that is smart enough to expose an API, an RSS feed reader… Even a mix of these if you want to play with cross-platform search and recommendations. https://mastodonpy.readthedocs.io

(DIR) Post #At38CKMl2XBOuGJVAm by mala@fosstodon.org
0 likes, 0 repeats

#Llamafile is a Mozilla tool that packages a language model in a single executable file that will run on most platforms. It is 100% local and has been optimized to run on slower hardware, from Raspberry Pis to my 8yo laptop. It is based on llama.cpp which supports a plethora of models, not just LLMs: I chose all-minilm because it’s tiny (50MB) and has open code, research papers, and datasets. https://github.com/Mozilla-Ocho/llamafile https://huggingface.co/sentence-transformers/all-MiniLM-L6-v2

(DIR) Post #At38CLHTdeYLkAqna4 by mala@fosstodon.org
0 likes, 0 repeats

In #BYOTA, we use #llamafile to calculate *sentence embeddings*.If you don’t know what embeddings are, just think about them as numerical descriptors of your Mastodon statuses, which are closer (as in two cities’ coordinates being close on a map) the more semantically similar their respective statuses are. We’ll get back later to this with a more visual description. If you are interested in embeddings and wanna delve deeper, see https://vickiboykis.com/what_are_embeddings/ by @vicki.

(DIR) Post #At38CMJzlmA4yH2K92 by mala@fosstodon.org
0 likes, 0 repeats

Marimo is a reactive notebook for Python, that is also sharable as an application. It mixes code, markdown and UI widgets in the same interface so you can (1) develop as you’d do with other notebook environments, (2) share it as an application, by hiding the code and only displaying the UI components, (3) allow people to use it as they want, customizing both code and interface within a grid-based UI. Above all, marimo relies on WASM to run everything inside one’s browser.

(DIR) Post #At38CN9OgfHJXh5NGS by mala@fosstodon.org
0 likes, 0 repeats

What does having a WASM-powered notebook mean? Consider #BYOTA: you can download it from its repo, pip-install its deps, and run it locally as any python notebook. But you can also deploy it as HTML+Javascript files, host it somewhere super cheap (coz the server won’t run any of your code), and people will be able to run it in their browser with no need to install anything else! Plus, this will work both with “my” algorithm and whatever alternative you might develop starting from BYOTA’s code.

(DIR) Post #At38CO0DWHWsBVnYau by mala@fosstodon.org
0 likes, 0 repeats

So, what can you do with #BYOTA? The first thing is embeddings visualization. In these pictures you can see a 2D plot of embeddings calculated on four different timelines: home (blue, only people I follow), local (orange, all posts from my instance, which is fosstodon.org), public (red, federated posts from people followed by users of my instance), and the timeline that I got searching for the #gopher hashtag (light blue).

(DIR) Post #At38CP8lHzxTiIntYG by mala@fosstodon.org
0 likes, 0 repeats

You can also plot all embeddings together! In this picture I have selected, analyzed, and annotated a few areas of this map. What i find cool is that semantically similar statuses will always be close to each other regardless of which timeline they appeared in, so you can follow content across different timelines. This is simple to grasp and easy to interpret, as it is based on the actual text present in statuses. And this way of encoding semantic similarity allows us to do many other things.

(DIR) Post #At38CQnD95uMq2jiEq by mala@fosstodon.org
0 likes, 0 repeats

Semantic search is one of them! If you provide a status ID (which you can get by selecting a post in the map) you can look for the statuses (default is top 5) which are most similar, that is closest, to it. You can also just make up a sentence that describes what you are interested in (e.g. the figure shows the closest statuses to “I am a fan of free software and everything opensource”). Note that the results come from different timelines, in this case public, local, and tag/gopher.

(DIR) Post #At38CRWwP4UJ7s8EW8 by mala@fosstodon.org
0 likes, 0 repeats

How could we have a timeline algorithm without post re-ranking? As you see in the figures, by relying on sentence embeddings you can have a content-based re-ranking of a timeline. You basically apply the “style” of a set of statuses to another list, by putting on top those statuses which are overall closer to the ones you provided. For privacy purposes I did not show a re-ranked public timeline: I did it with my own messages, re-ranking them according to the “style” of #gopher and #bookstodon.

(DIR) Post #At38CSMLJxbXhIBHdY by mala@fosstodon.org
0 likes, 0 repeats

“This looks nice, but how heavy is the calculation of these embeddings?”.I embedded 800 statuses using four different models, from the 22M-parameters all-MiniLM to the 7B e5-mistral. I tested them with two different local servers, llamafile and ollama, and two different laptops, my 2016 MacBook Pro with Intel CPU and my 2024 one with M3 Max. The results are shown in the picture. The summary is: it can work on older hardware, and on recent one you will barely notice the calculation overhead.

(DIR) Post #At38CSvnC9xdTElatM by mala@fosstodon.org
0 likes, 0 repeats

All-miniLM took 11 sec on M3, 52 sec on my 8yo laptop. And embeddings are already good! Larger models might provide extra perks (which I have not investigated yet) but at the price of higher compute. Interestingly, despite the fact that both ollama and llamafile are based on llama.cpp, ollama seems to be faster on newer hardware / smaller models, while llamafile becomes a better choice for larger models on older hardware. I used default params for both servers so there is space for improvement.

(DIR) Post #At38CTQHMoLazn1wPY by mala@fosstodon.org
0 likes, 0 repeats

“What data do I have to share to run this?”The answer is: none. The embedding servers run completely offline. And for the marimo notebook, the only remote connections you’ll do are those related to the marimo and WASM dependencies at bootstrap time, and those required to download posts from the mastodon instance. You are always in control of which and how many messages you download before running the algorithm. From then on (embeddings, plots, search, re-ranking) everything runs on your device.

(DIR) Post #At38CTz1He8WjXHgYq by mala@fosstodon.org
0 likes, 0 repeats

This was #BYOTA, thanks for following this long thread! What you see here are some of the next steps I planned: in addition to natural ML extensions, I’d like to see it grow as a tool for people to experiment with different algorithms and easily share them, and for less tech-savvy people to use as easily as possible. For this to be true, I will invest some time in understanding how to bring this to fruition at protocol level, rather than a single application. Stay tuned!