15 Sep 25

This is really cool. Nushell used to support this but apparently it was a pain to maintain. Maybe one day…

by linkraven 4 months ago saved 5 times

13 Sep 25

A really smart ant overall simple checksum that works on unordered collections

by sebastien 4 months ago saved 2 times

11 Sep 25

A map/reduce workflow for LLMs, with what looks like local caching. To me build systems and data processing pipelines like this one have a big intersection.

by sebastien 5 months ago

08 Sep 25

Like any other map, The Internet map is a scheme displaying objects’ relative position; but unlike real maps (e.g. the map of the Earth) or virtual maps (e.g. the map of Mordor), the objects shown on it are not aligned on a surface. Mathematically speaking, The Internet map is a bi-dimensional presentation of links between websites on the Internet. Every site is a circle on the map, and its size is determined by website traffic, the larger the amount of traffic, the bigger the circle. Users’ switching between websites forms links, and the stronger the link, the closer the websites tend to arrange themselves to each other.

by cos 5 months ago

In plain English, this service looks at which websites link to a particular target website, and then it ranks websites that are popular among those linking websites using a method commonly used in recommendation algorithms.

In technical jargon, it reinterprets the incident edges in the adjacency matrix as sparse high dimensional vector, and uses cosine similarity to find the nearest neighbors nodes within this feature-space.

by cos 5 months ago

This is a write-up about an experiment from a few months ago, in how to find websites that are similar to each other. Website similarity is useful for many things, including discovering new websites to crawl, as well as suggesting similar websites in the Marginalia Search random exploration mode.

by cos 5 months ago


02 Sep 25

The National Grid is the electric power transmission network for Great Britain

by struanr 5 months ago saved 4 times
Tags:

30 Aug 25

These platforms do not exist as a place for art to live and flourish. They exist as a place to flatten art into commodity. There is no context here. Instead there are small armies of lawyers trying to make sure a rich person doesn’t get fined for letting a 13-year old medieval music nerd see the Latin prepositional word for “with”.

by kawcco 5 months ago

21 Aug 25

A unified model for representing syntax trees.

by sebastien 5 months ago

12 Aug 25

What kind of society do we live in, for this to regress so sharply?

by linkraven 6 months ago saved 2 times

03 Aug 25

A tool from Google that uses LLMs to extract structured data. I suppose it’s quite reliable!

by sebastien 6 months ago

01 Aug 25

A funny (even more as it’s true) opinion piece by Cory Doctorow on metadata.

by sebastien 6 months ago saved 3 times

17 Jul 25

Very interesting way to avoid using Python. As a Python-hater myself, I appreciate this greatly.

Considering that ML is a pretty important field, and will continue to be going into the future, this is an option I have to keep my eye on.

by linkraven 6 months ago

08 Jul 25

Lots of cool little interactive articles and visualisations.

by sebastien 7 months ago saved 2 times


03 Jul 25

The Comparative Income Taxation Database consists of yearly data on the top marginal income tax rate for a legal individual, and includes the amount from which the top marginal rate applies, what law governs the taxation, and the source of this information (law or consulted literature). The sample included in the data set consists of 20 countries. The time period covered for each country is 1800 (or independence) to 2010. A country is considered to have adopted a modern income tax system if an independent government levies taxes yearly on comprehensive and directly assessed forms of personal income.

by unseeing 7 months ago
Tags:

The Employment Projections (EP) program publishes the classification systems used to produce the employment projections. In addition, EP publishes selected crosswalks between these classification systems and those from other data sources.

by unseeing 7 months ago

29 May 25

LiveStore is a state management framework based on SQLite and event-sourcing. It’s designed for demanding applications and based on years of research.

by chrisSt 8 months ago saved 2 times

Jazz gives you data without needing a database — plus auth, permissions, files and multiplayer without needing a backend. Do everything right from the frontend and ship better apps, faster.

by chrisSt 8 months ago saved 3 times