linkhut

08 Jan 26

Create, debug & maintain dlt pipelines in production - dltHub Workspace

Go from writing pipeline code to ingesting data and delivering reports via Notebooks - all in one flow. Discover over 10,100 REST API data sources today.

The dltHub Workspace is a dedicated environment for developers to create, debug, and maintain dlt pipelines. As the first workflow, we’ve launched LLM-native pipeline development for over 10,100 REST API data sources in the Workspace.

In this workflow you can generate the LLMs context for your source, can go from writing pipeline code to ingesting data and delivering reports via Notebooks - all in one flow, with outputs tailored to data users.

by tmfnk 27 days ago

Tags:

28 Dec 25

langgenius/dify: Production-ready platform for agentic workflow development.

https://github.com/langgenius/dify/

Production-ready platform for agentic workflow development. - langgenius/dify

by tmfnk 1 month ago

Tags:

OpenDCAI/DataFlow: Easy Data Preparation with latest LLMs-based Operators and Pipelines.

https://github.com/OpenDCAI/DataFlow

Easy Data Preparation with latest LLMs-based Operators and Pipelines. - OpenDCAI/DataFlow

by tmfnk 1 month ago

Tags:

karpathy/hn-time-capsule: Analyzing Hacker News discussions from a decade ago in hindsight with LLMs

https://github.com/karpathy/hn-time-capsule

Analyzing Hacker News discussions from a decade ago in hindsight with LLMs - karpathy/hn-time-capsule

A Hacker News time capsule project that pulls the HN frontpage from exactly 10 years ago, analyzes articles and discussions using an LLM to evaluate prescience with the benefit of hindsight, and generates an HTML report.

by tmfnk 1 month ago

Tags:

arben-adm/mcp-sequential-thinking

https://github.com/arben-adm/mcp-sequential-thinking

A Model Context Protocol (MCP) server that facilitates structured, progressive thinking through defined stages. This tool helps break down complex problems into sequential thoughts, track the progression of your thinking process, and generate summaries.

by tmfnk 1 month ago

Tags:

Auto-Grading Ten Years of Earnings Calls for Prescience and Delusion | knowtrend.ai

https://knowtrend.ai/blog/hindsight-analysis

Using LLMs to identify extraordinary prescience, confident delusions, and the madness of earnings calls. Inspired by Andrej Karpathy

Auto-Grading Ten Years of Earnings Calls for Prescience and Delusion

by tmfnk 1 month ago

Tags:

Earnings Call Civilizational Score Prompt

https://gist.github.com/yevman/02dce4f9c8da4b3d9d757399d86c940b

Earnings Call Civilizational Score Prompt. GitHub Gist: instantly share code, notes, and snippets.

You are a financial historian and industry expert conducting a review of past earnings calls with the benefit of hindsight and wisdom.

by tmfnk 1 month ago

Tags:

17 Dec 25

Useful patterns for building HTML tools

https://simonwillison.net/2025/Dec/10/html-tools/

What can you build with one HTML file and an LLM? A lot more than you think. This post shows hard-won patterns from 150+ real tools, showing how to skip frameworks, exploit browser primitives, and build projects fast with copy/pasteable code.

by tmfnk 1 month ago saved 2 times

Tags:

13 Dec 25

orneryd/NornicDB: NornicDB is a high-performance graph database designed for AI agents and knowledge systems.

https://github.com/orneryd/NornicDB

NornicDB is a high-performance graph database designed for AI agents and knowledge systems. It speaks Neo4j’s language (Bolt protocol Cypher) so you can switch with zero code changes, while adding intelligent features including GPU accelerated embedding search, k-means, and auto TLP with optional LLM inference, plus plugins. - orneryd/NornicDB

by tmfnk 1 month ago

Tags:

12 Dec 25

simstudioai/sim: Open-source platform to build and deploy AI agent workflows.

https://github.com/simstudioai/sim

Open-source platform to build and deploy AI agent workflows. - simstudioai/sim

Design agent workflows visually on a canvas—connect agents, tools, and blocks, then run them instantly.

by tmfnk 1 month ago saved 2 times

Tags:

01 Dec 25

Hugo-Dz/spritefusion-pixel-snapper: A tool to snap pixels to a perfect grid. Designed to fix messy and inconsistent pixel art generated by AI.

https://github.com/Hugo-Dz/spritefusion-pixel-snapper

A tool to snap pixels to a perfect grid. Designed to fix messy and inconsistent pixel art generated by AI. - Hugo-Dz/spritefusion-pixel-snapper

by tmfnk 2 months ago

Tags:

Writing a good CLAUDE.md | HumanLayer Blog

https://www.humanlayer.dev/blog/writing-a-good-claude-md

CLAUDE.md is a high-leverage configuration point for Claude Code.

Learning how to write a good CLAUDE.md (or AGENTS.md) is a key skill for agent-enabled software engineering.

An LLM will perform better on a task when its’ context window is full of focused, relevant context

by tmfnk 2 months ago

Tags:

30 Nov 25

Yandori news flow

https://yandori.io/news-flow/

a system that monitors ~200,000 news RSS feeds in near real-time and clusters related articles to show how stories spread across the web. It uses Snowflake’s Arctic model for embeddings and HNSW for fast similarity search. Each “story cluster” shows who published first, how fast it propagated, and how the narrative evolved as more outlets picked it up.

by tmfnk 2 months ago saved 2 times

Tags:

29 Nov 25

LLM Course

https://github.com/mlabonne/llm-course

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

mlabonne.github.io/blog/

by tmfnk 2 months ago saved 3 times

Tags:

27 Nov 25

CHATS-lab/verbalized-sampling: Verbalized Sampling, a training-free prompting strategy to mitigate mode collapse in LLMs by requesting responses with probabilities.

https://github.com/CHATS-lab/verbalized-sampling

Verbalized Sampling, a training-free prompting strategy to mitigate mode collapse in LLMs by requesting responses with probabilities. Achieves 2-3x diversity improvement while maintaining quality. Model-agnostic framework with CLI/API for creative writing, synthetic data generation, and dialogue simulation. - CHATS-lab/verbalized-sampling

by tmfnk 2 months ago

Tags:

GibsonAI/Memori: Open-Source Memory Engine for LLMs, AI Agents & Multi-Agent Systems

https://github.com/GibsonAI/memori?tab=readme-ov-file#examples

Open-Source Memory Engine for LLMs, AI Agents

What is Memori Memori enables any LLM to remember conversations, learn from interactions, and maintain context across sessions with a single line: memori.enable(). Memory is stored in standard SQL databases (SQLite, PostgreSQL, MySQL) that you fully own and control.

Why Memori?

One-line integration - Works with OpenAI, Anthropic, LiteLLM, LangChain, and any LLM framework SQL-native storage - Portable, queryable, and auditable memory in databases you control 80-90% cost savings - No expensive vector databases required Zero vendor lock-in - Export your memory as SQLite and move anywhere Intelligent memory - Automatic entity extraction, relationship mapping, and context prioritization

by tmfnk 2 months ago

Tags:

microsoft/fara

https://github.com/microsoft/fara

Fara-7B is Microsoft’s first agentic small language model (SLM) designed specifically for computer use.

With only 7 billion parameters, Fara-7B is an ultra-compact Computer Use Agent (CUA) that achieves state-of-the-art performance within its size class and is competitive with larger, more resource-intensive agentic systems.

by tmfnk 2 months ago

Tags:

lotus-data/lotus: AI-Powered Data Processing: Use LOTUS to process all of your datasets with LLMs and embeddings.

https://github.com/lotus-data/lotus

AI-Powered Data Processing: Use LOTUS to process all of your datasets with LLMs and embeddings. Enjoy up to 1000x speedups with fast, accurate query processing, that’s as simple as writing Pandas code - lotus-data/lotus

LOTUS is an open-source query engine that makes programming as easy as writing Pandas and optimizes your programs for up to 400x speedups.

by tmfnk 2 months ago

Tags:

DeepScholar

https://deep-scholar.vercel.app/

Research assistant powered by Lotus.

DeepScholar, an openly-accessible DeepResearch system from Berkeley & Stanford.

DeepScholar efficiently processes 100s of articles, demonstrating strong long-form research synthesis capabilities, competitive with OpenAI’s DR, while running up to 2x faster!

by tmfnk 2 months ago

Tags:

25 Nov 25

OCR Arena

https://www.ocrarena.ai/battle

OCR Arena is a free playground for testing and evaluating leading foundation VLMs and open source OCR models side-by-side. Upload a document, measure accuracy, and vote for the best models on a public leaderboard.

by tmfnk 2 months ago