15 Sep 25
This is really cool. Nushell used to support this but apparently it was a pain to maintain. Maybe one day…
13 Sep 25
A really smart ant overall simple checksum that works on unordered collections
11 Sep 25
A map/reduce workflow for LLMs, with what looks like local caching. To me build systems and data processing pipelines like this one have a big intersection.
08 Sep 25
Like any other map, The Internet map is a scheme displaying objects’ relative position; but unlike real maps (e.g. the map of the Earth) or virtual maps (e.g. the map of Mordor), the objects shown on it are not aligned on a surface. Mathematically speaking, The Internet map is a bi-dimensional presentation of links between websites on the Internet. Every site is a circle on the map, and its size is determined by website traffic, the larger the amount of traffic, the bigger the circle. Users’ switching between websites forms links, and the stronger the link, the closer the websites tend to arrange themselves to each other.
In plain English, this service looks at which websites link to a particular target website, and then it ranks websites that are popular among those linking websites using a method commonly used in recommendation algorithms.
In technical jargon, it reinterprets the incident edges in the adjacency matrix as sparse high dimensional vector, and uses cosine similarity to find the nearest neighbors nodes within this feature-space.
This is a write-up about an experiment from a few months ago, in how to find websites that are similar to each other. Website similarity is useful for many things, including discovering new websites to crawl, as well as suggesting similar websites in the Marginalia Search random exploration mode.
Some simple hash functions including a reversible one.
02 Sep 25
The National Grid is the electric power transmission network for Great Britain
30 Aug 25
These platforms do not exist as a place for art to live and flourish. They exist as a place to flatten art into commodity. There is no context here. Instead there are small armies of lawyers trying to make sure a rich person doesn’t get fined for letting a 13-year old medieval music nerd see the Latin prepositional word for “with”.
21 Aug 25
A unified model for representing syntax trees.
12 Aug 25
What kind of society do we live in, for this to regress so sharply?
03 Aug 25
A tool from Google that uses LLMs to extract structured data. I suppose it’s quite reliable!
01 Aug 25
A funny (even more as it’s true) opinion piece by Cory Doctorow on metadata.
17 Jul 25
Very interesting way to avoid using Python. As a Python-hater myself, I appreciate this greatly.
Considering that ML is a pretty important field, and will continue to be going into the future, this is an option I have to keep my eye on.
08 Jul 25
Lots of cool little interactive articles and visualisations.
A UI component to view JSON as a table, that’s nice!
03 Jul 25
The Comparative Income Taxation Database consists of yearly data on the top marginal income tax rate for a legal individual, and includes the amount from which the top marginal rate applies, what law governs the taxation, and the source of this information (law or consulted literature). The sample included in the data set consists of 20 countries. The time period covered for each country is 1800 (or independence) to 2010. A country is considered to have adopted a modern income tax system if an independent government levies taxes yearly on comprehensive and directly assessed forms of personal income.
The Employment Projections (EP) program publishes the classification systems used to produce the employment projections. In addition, EP publishes selected crosswalks between these classification systems and those from other data sources.
29 May 25
LiveStore is a state management framework based on SQLite and event-sourcing. It’s designed for demanding applications and based on years of research.
Jazz gives you data without needing a database — plus auth, permissions, files and multiplayer without needing a backend. Do everything right from the frontend and ship better apps, faster.