08 Jan 26

Go from writing pipeline code to ingesting data and delivering reports via Notebooks - all in one flow. Discover over 10,100 REST API data sources today.

The dltHub Workspace is a dedicated environment for developers to create, debug, and maintain dlt pipelines. As the first workflow, we’ve launched LLM-native pipeline development for over 10,100 REST API data sources in the Workspace.

In this workflow you can generate the LLMs context for your source, can go from writing pipeline code to ingesting data and delivering reports via Notebooks - all in one flow, with outputs tailored to data users.

by tmfnk 27 days ago
Tags:

28 Dec 25

Production-ready platform for agentic workflow development. - langgenius/dify

by tmfnk 1 month ago

Easy Data Preparation with latest LLMs-based Operators and Pipelines. - OpenDCAI/DataFlow

by tmfnk 1 month ago

Analyzing Hacker News discussions from a decade ago in hindsight with LLMs - karpathy/hn-time-capsule

A Hacker News time capsule project that pulls the HN frontpage from exactly 10 years ago, analyzes articles and discussions using an LLM to evaluate prescience with the benefit of hindsight, and generates an HTML report.

by tmfnk 1 month ago

A Model Context Protocol (MCP) server that facilitates structured, progressive thinking through defined stages. This tool helps break down complex problems into sequential thoughts, track the progression of your thinking process, and generate summaries.

by tmfnk 1 month ago

Using LLMs to identify extraordinary prescience, confident delusions, and the madness of earnings calls. Inspired by Andrej Karpathy

Auto-Grading Ten Years of Earnings Calls for Prescience and Delusion

by tmfnk 1 month ago

Earnings Call Civilizational Score Prompt. GitHub Gist: instantly share code, notes, and snippets.

You are a financial historian and industry expert conducting a review of past earnings calls with the benefit of hindsight and wisdom.

by tmfnk 1 month ago

17 Dec 25

What can you build with one HTML file and an LLM? A lot more than you think. This post shows hard-won patterns from 150+ real tools, showing how to skip frameworks, exploit browser primitives, and build projects fast with copy/pasteable code.

by tmfnk 1 month ago saved 2 times

13 Dec 25

NornicDB is a high-performance graph database designed for AI agents and knowledge systems. It speaks Neo4j’s language (Bolt protocol Cypher) so you can switch with zero code changes, while adding intelligent features including GPU accelerated embedding search, k-means, and auto TLP with optional LLM inference, plus plugins. - orneryd/NornicDB

by tmfnk 1 month ago

12 Dec 25

Open-source platform to build and deploy AI agent workflows. - simstudioai/sim

Design agent workflows visually on a canvas—connect agents, tools, and blocks, then run them instantly.

by tmfnk 1 month ago saved 2 times

01 Dec 25

A tool to snap pixels to a perfect grid. Designed to fix messy and inconsistent pixel art generated by AI. - Hugo-Dz/spritefusion-pixel-snapper

by tmfnk 2 months ago

CLAUDE.md is a high-leverage configuration point for Claude Code.

Learning how to write a good CLAUDE.md (or AGENTS.md) is a key skill for agent-enabled software engineering.

An LLM will perform better on a task when its’ context window is full of focused, relevant context

by tmfnk 2 months ago

30 Nov 25

a system that monitors ~200,000 news RSS feeds in near real-time and clusters related articles to show how stories spread across the web. It uses Snowflake’s Arctic model for embeddings and HNSW for fast similarity search. Each “story cluster” shows who published first, how fast it propagated, and how the narrative evolved as more outlets picked it up.

by tmfnk 2 months ago saved 2 times
Tags:

29 Nov 25

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

mlabonne.github.io/blog/

by tmfnk 2 months ago saved 3 times
Tags:

27 Nov 25

Verbalized Sampling, a training-free prompting strategy to mitigate mode collapse in LLMs by requesting responses with probabilities. Achieves 2-3x diversity improvement while maintaining quality. Model-agnostic framework with CLI/API for creative writing, synthetic data generation, and dialogue simulation. - CHATS-lab/verbalized-sampling

by tmfnk 2 months ago

Open-Source Memory Engine for LLMs, AI Agents

What is Memori Memori enables any LLM to remember conversations, learn from interactions, and maintain context across sessions with a single line: memori.enable(). Memory is stored in standard SQL databases (SQLite, PostgreSQL, MySQL) that you fully own and control.

Why Memori?

One-line integration - Works with OpenAI, Anthropic, LiteLLM, LangChain, and any LLM framework SQL-native storage - Portable, queryable, and auditable memory in databases you control 80-90% cost savings - No expensive vector databases required Zero vendor lock-in - Export your memory as SQLite and move anywhere Intelligent memory - Automatic entity extraction, relationship mapping, and context prioritization

by tmfnk 2 months ago

Fara-7B is Microsoft’s first agentic small language model (SLM) designed specifically for computer use.

With only 7 billion parameters, Fara-7B is an ultra-compact Computer Use Agent (CUA) that achieves state-of-the-art performance within its size class and is competitive with larger, more resource-intensive agentic systems.

by tmfnk 2 months ago

AI-Powered Data Processing: Use LOTUS to process all of your datasets with LLMs and embeddings. Enjoy up to 1000x speedups with fast, accurate query processing, that’s as simple as writing Pandas code - lotus-data/lotus

LOTUS is an open-source query engine that makes programming as easy as writing Pandas and optimizes your programs for up to 400x speedups.

by tmfnk 2 months ago

Research assistant powered by Lotus.

DeepScholar, an openly-accessible DeepResearch system from Berkeley & Stanford.

DeepScholar efficiently processes 100s of articles, demonstrating strong long-form research synthesis capabilities, competitive with OpenAI’s DR, while running up to 2x faster!

by tmfnk 2 months ago

25 Nov 25

OCR Arena is a free playground for testing and evaluating leading foundation VLMs and open source OCR models side-by-side. Upload a document, measure accuracy, and vote for the best models on a public leaderboard.

by tmfnk 2 months ago
Tags: