Very ML | State-of-the-art Machine Learning News Feed | Infomate

/r/MachineLearning
последний пост 4 часа назад

Non-deterministic Vulnerability Detection Benchmark System [P]

Syntactically robust NLI for semantics of imperfectly generated text? [R]

Recommendations for speech annotation tools [D]

Some new updates to Papers with Code [P]

[ECCV 2026] Paper Decision Appeals Discussion [D]

An Update on Matrix Recurrent Units, an Attention Alternative [R]

Data-centric debugging for teams training neural nets [P]

Best current methods for finetuning whisper on domain specific vocabulary? [P]

EMA on LoRA ? [R]

A slightly improved DVD-JEPA demo [P]

I released a softmax-free attention model at GPT-2 Medium scale (~354M params, 11.5B tokens): structural sparsity + tile-skipping kernels for long-context VRAM savings. Open weights + custom Triton kernels [R]

Python packages for particle swarms, genetic algorithms. Scikit-opt maybe? [D]

Studying FLUX in diffusers library was hard, so I built a smaller open-source version [P]

TSAuditor: A time-series auditing framework [P]

Hi Reddit, I posted my Build Your Own LLM workshop to Youtube teaching ML, LLM and math intuition [P]

Towards Data Science
последний пост 11 часов назад

Encoding Categorical Data for Outlier Detection

How to Use Claude Code in Your Browser

When RAG Users Ask Vague Questions: Clarify Once, Learn the Default

Neural Networks, Explained for Beginners: Start Here If They’ve Confused You

Tool Calling, Explained: How AI Agents Decide What to Do Next

Reconstructing the Table of Contents a PDF Forgot to Ship, So RAG Can Scope by Section

What Are the Possibilities to Build Date Tables in Self-Service Environments?

7 Crucial Barriers Between Data Teams and Self-Healing Data Architecture

Making a PDF’s Images Searchable for RAG, Without Paying to Read Them All

Materialized Lake Views in Microsoft Fabric: When Your Medallion Fits in a SELECT Statement

Python 3.14 and its New JIT Compiler

Building a Custom GStreamer Plugin for NVIDIA DeepStream

I Tried to Schedule My ETL Pipeline. Here’s What I Didn’t Expect.

Parse Scanned PDFs for RAG with EasyOCR: Free OCR Gives You Words, Not a Document

GPU-Resident Top-K for Agentic RAG: I Built a CUDA Kernel So My Retrieval Step Would Stop Bouncing Off the GPU

Distill.pub
последний пост None

The Gradient
последний пост 4 months назад

The Reasonable Effectiveness of Virtue Ethics in AI Alignment

TheSequence
последний пост 16 часов назад

The Sequence Special #881: The Soccer World Cup of AI Models

The Sequence Radar #880: Last Week in AI: A $60B Cursor Deal, Google's Brain Drain, and Midjourney's Body Scanner

The Sequence Opinion #879: When Tokens Become Balance Sheet Items

The Sequence AI of the Week #878: Inside Google Deepmind's First Real Crack in Next-Token Generation

The Sequence Knowledge #878: Beyond Transformer: What We Learned

The Sequence Radar #877: Last Week in AI: Anthropic Ships, Apple Borrows, Musk Lists, Bezos Builds

The Sequence Opinion: Systems of Record vs. Systems of Action

The Sequence AI of the Week #875: Why Your Language Model Needs a Nap

The Sequence Knowledge #874: Transformers or Not?

The Sequence Radar #873: Last Week in AI: Soccer, S-1s, and Supermodels

The Sequence Opinion #872: The Cake Is a Battlefield: Who Really Controls the AI Stack

The Sequence AI of the Week #871: Inside the Loop with Claude Opus 4.8

The Sequence Knowledge #870: Liquid Models and the Search for a Post-Transformer Architecture

The Sequence Radar #869: Last Week in AI: The Token Becomes the Unit of Account — Opus 4.8, OpenRouter, Cognition, Snowflake, and a papal warning

The Sequence Opinion #868: Recursion Is the New Scaling Law

Synced Review
последний пост None

📓 Cool Blogs

ODS.ai Habr
последний пост 2 months, 2 weeks назад

Вайбкодинг по Chess’ноку. 1. e4

Почему я стал ИТ-волонтером & Датасет новостей о противоречиях современного общества

[Перевод] Как устроен Codex

Курс Natural Language Processing & LLMs — новый сезон

SWE-MERA — новый динамический бенчмарк для моделей агентной генерации кода

Machine Learning Mastery
последний пост 4 days, 15 hours назад

The Roadmap to Mastering AI Agent Evaluation

Building an End-to-End Sentiment Analysis Pipeline with Scikit-LLM

AI Agent Tool Design: What Works and What Doesn’t

Python Concepts Every AI Engineer Must Master

Multi-Label Text Classification with Scikit-LLM

Multimodal Browser AI with Transformers.js for Images and Speech

The Practitioner’s Guide to AgentOps

Building Semantic Search with Transformers.js and Sentence Embeddings

Using Scikit-LLM with Open-Source LLMs

Scikit-LLM vs. Traditional Text Classifiers: When Should You Use an LLM?

The Roadmap for Mastering LLMOps in 2026

Serving Multiple Users at Once: How Continuous Batching Keeps LLM Inference Efficient

Building a Context Pruning Pipeline for Long-Running Agents

The Statistics of Token Selection: Logits, Temperature, and Top-P Walkthrough

Building a Multi-Tool Gemma 4 Agent with Error Recovery

ML in Production
последний пост None

Sorta Insightful

Sorta Insightful
последний пост 1 month назад

AI Will Not Make Your Job Chill

Why I Signed The Amicus Brief for Anthropic v Department of War

MIT Mystery Hunt 2026

Authentic Imperfection

Lil'Log
последний пост None

inFERENCe
последний пост 3 months, 3 weeks назад

The Future of Software

Deep Learning is Powerful Because It Makes Hard Things Easy - Reflections 10 Years On

The Spectator
последний пост None

The Unofficial Google Data Science Blog

The Unofficial Google Data Science Blog
последний пост None

Off the Convex Path
последний пост None

Jay Alammar
последний пост None

Piekniewski's blog
последний пост None

fast.ai NLP
последний пост None

Sebastian Ruder
последний пост None

Andrew Karpathy blog
последний пост 4 months, 1 week назад

microgpt

大トロ
последний пост None

🔬 Science

Papers With Code

Papers With Code
последний пост None

Papers With Code

Papers With Code
последний пост None

Papers With Code

Papers With Code
последний пост None

💼 University and corporation labs

DeepMind
последний пост 6 days, 6 hours назад

Unlocking UK house-building with AI-accelerated planning

Securing the future of AI agents

DiffusionGemma: 4x faster text generation

Investing in multi-agent AI safety research

Fluid, natural voice translation with Gemini 3.5 Live Translate

Introducing Gemma 4 12B: a unified, encoder-free multimodal model

Powering the future of robotics in Europe

Measuring the impact of learning with AI in Sierra Leone and beyond

We’re launching the Google DeepMind Accelerator program in Asia Pacific to tackle environmental risks

Fast-tracking genetic leads to reverse cellular aging

Simulate real-world places with Project Genie and Street View

Introducing Gemini Omni

Introducing Google Antigravity 2.0

Gemini for Science: AI experiments and tools for a new era of discovery

Making it easier to understand how content was created and edited

Google
последний пост 5 days, 19 hours назад

From AI potential to agentic reality: Driving the UK’s next chapter

How growing UK midsize businesses are building in the AI era

How Siemens "slices the elephant," advancing agentic workflows for industrial software development

Cloud CISO Perspectives: The 4 lessons that guided AI Threat Defense

Introducing the Open Knowledge Format

Powering the next era of Confidential AI

Claude Fable 5: Available on Google Cloud

Report: GKE Inference Gateway delivers up to 92% faster AI responses

Detecting and containing AI-powered threats with Google Security Operations agents

How to unlock true ROI in software development – a deep dive into the latest DORA research

Modernizing Healthcare: How Alcidion achieved greater stability and performance with AlloyDB

What's new for Managed Service for Apache Spark clusters

How Trustpilot built a real-time architecture for data enrichment using Gemma

The fully-managed Remote MCP Server for AlloyDB is now Generally Available

Cloud CISO Perspectives: How to build an AI-ready security program for the public sector

OpenAI
последний пост None

Microsoft
последний пост 1 week, 3 days назад

Ire identifies another LOTUSLITE specimen

Data Formulator 0.7: AI-powered data analytics for enterprise data

Extending Human Intelligence Through AI

MagenticLite, MagenticBrain, Fara1.5: An agentic experience optimized for small models

Vega: Zero-knowledge proofs for digital identity in the age of AI

Further Notes on Our Recent Research on AI Delegation and Long-Horizon Reliability

mimalloc: A new, high-performance, scalable memory allocator for the modern era

GridSFM: A new, small foundation model for the electric grid

Advancing AI for materials with MatterSim: experimental synthesis, faster simulation, and multi-task models

SocialReasoning-Bench: Measuring whether AI agents act in users’ best interests

Building realistic electric transmission grid dataset at scale: a pipeline from open dataset

Microsoft at NSDI 2026: Advances in large-scale networked systems

Red-teaming a network of agents: Understanding what breaks when AI agents interact at scale

AutoAdapt: Automated domain adaptation for large language models

Can we AI our way to a more sustainable world?

MIT AI
последний пост 3 days, 9 hours назад

A better way to model the behavior of metal alloys

MIT in the media: For the future of tech, "Massachusetts can absolutely lead"

In game theory, generalists sometimes win out over specialists

Could AI tell you where you left your keys?

MIT’s Initiative for New Manufacturing builds momentum

Jinhua Zhao named head of the Department of Urban Studies and Planning

When it comes to predicting people’s preferences, it pays to consider “the power of three”

MIT affiliates win 2026 Hertz Foundation Fellowships

Startup’s nuclear-inspired cooling system could make data centers more sustainable

The consequences of relying on AI for accurate news

The crucial human component in computing and AI

PATH to boost AI training and career opportunities for industry-aligned jobs

NSF renews support for MIT-led AI and physics institute, expanding a new model for discovery

Teaching AI agents to ask better questions by playing “Battleship”

Tod Machover receives George Peabody Medal for contributions to music and technology

Berkeley AI
последний пост 1 month, 2 weeks назад

Adaptive Parallel Reasoning: The Next Paradigm in Efficient Inference Scaling

Gradient-based Planning for World Models at Longer Horizons

Identifying Interactions at Scale for LLMs

Information-Driven Design of Imaging Systems

RL without TD learning

What exactly does word2vec learn?

AWS Machine Learning

AWS Machine Learning
последний пост 9 часов назад

Building pay-per-intelligence for AI agents: How Ampersend uses Amazon Bedrock AgentCore Payments

Embed the world: Multimodal AI for searchable aerial imagery at scale

Running ComfyUI workflows on Amazon SageMaker AI processing jobs

Introducing Web Search on Amazon Bedrock AgentCore

Accelerate campaign workflow with insights from Adobe Marketing Agent for Amazon Quick

Monitor and debug generative AI inference with SageMaker detailed metrics and Insights dashboard on CloudWatch

Amazon Bedrock AgentCore harness is now generally available: Go from idea to production-grade agent in minutes

Amazon SageMaker AI Async Inference now supports inline request payloads

Get back hours every day with autonomous agents in Amazon Quick

Context intelligence for your data and AI agents at scale

New in Amazon Bedrock AgentCore: Build agents with broader knowledge and continuous learning

Safeguard your agentic AI applications with the Amazon Bedrock Guardrails InvokeGuardrailChecks API

Introducing container caching in Amazon SageMaker AI for faster model scaling

Parallelize speculative decoding with P-EAGLE on Amazon SageMaker AI

Introducing Gemma 4 models on Amazon Bedrock

NVIDIA
последний пост 14 часов назад

At ISC, JUPITER Shows What Exascale Science Looks Like

NAIRR Science Program Reshapes Scientific Research, Powered by NVIDIA AI Infrastructure

NVIDIA Vera CPU Opens the Way for Agentic Scientific AI at Los Alamos National Laboratory

From Materials Simulation to Experimental Astronomy, New NVIDIA AI Software Unlocks Scientific Discoveries

Eco Wave Power Turns Waves Into Watts With NVIDIA AI Infrastructure and Digital Twins

Hotter Than a Hot Tub: The 45°C Breakthrough to Cool AI’s Biggest Machines

How FERC’s Large-Load Interconnection Actions Help Address Grid Stress, Improve Affordability

At Cannes Lions, NVIDIA Partners Reshape Advertising and Marketing With AI

Sync and Stream: GeForce NOW Connects to Members’ Game Libraries Across Devices

France Advances Europe’s AI Future With NVIDIA Technologies

Hands Free, AIs Forward: NVIDIA XR AI Brings Agents to AR Glasses

Coherent Breaks Ground on Expanded Texas Facility, Scaling AI’s Optical Backbone

Build Your Own Transaction Foundation Model for Financial Intelligence

HPE AI Factory With NVIDIA Expands for the Era of Agents

How to Optimize Transformer-Based Models for Low-Precision Training

Facebook
последний пост 3 weeks, 6 days назад

SilverTorch: Index as Model — A New Retrieval Paradigm for Recommendation Systems

Reel Friends: Building Social Discovery that Scales to Billions

Modernizing the Facebook Groups Search to Unlock the Power of Community Knowledge

Capacity Efficiency at Meta: How Unified AI Agents Optimize Performance at Hyperscale

How Meta Used AI to Map Tribal Knowledge in Large-Scale Data Pipelines

KernelEvolve: How Meta’s Ranking Engineer Agent Optimizes AI Infrastructure

Meta Adaptive Ranking Model: Bending the Inference Scaling Curve to Serve LLM-Scale Models for Ads

AI for American-Produced Cement and Concrete

Friend Bubbles: Enhancing Social Discovery on Facebook Reels

Ranking Engineer Agent (REA): The Autonomous AI Agent Accelerating Meta’s Ads Ranking Innovation

Patch Me If You Can: AI Codemods for Secure-by-Default Android Apps

RCCLX: Innovating GPU communications on AMD platforms

The Death of Traditional Testing: Agentic Development Broke a 50-Year-Old Field, JiTTesting Can Revive It

Adapting the Facebook Reels RecSys AI Model Based on User Feedback

DrP: Meta’s Root Cause Analysis Platform at Scale

Uber Engineering
последний пост None

neptune.ai
последний пост 6 months, 3 weeks назад

We are joining OpenAI

Synthetic Data for LLM Training

What are LLM Embeddings: All you Need to Know

Detecting and Fixing ‘Dead Neurons’ in Foundation Models

Part 2: Instruction Fine-Tuning: Evaluation and Advanced Techniques for Efficient Training

How to Optimize LLM Inference

A Researcher’s Guide to LLM Grounding

Instruction Fine-Tuning: Fundamentals, Architecture Modifications, and Loss Functions

▶️ YouTube

Yannic Kilcher
последний пост 3 months, 2 weeks назад

I BUILT A FULLY AUTOMATIC MANSPLAINER

Traditional X-Mas Stream

Traditional Holiday Live Stream

TiDAR: Think in Diffusion, Talk in Autoregression (Paper Analysis)

Titans: Learning to Memorize at Test Time (Paper Analysis)

[Paper Analysis] The Free Transformer (and some Variational Autoencoder stuff)

[Video Response] What Cloudflare's code mode misses about MCP and tool calling

[Paper Analysis] On the Theoretical Limitations of Embedding-Based Retrieval (Warning: Rant)

Henry AI Labs
последний пост None

3blue1brown
последний пост 6 days, 9 hours назад

100 random chords, how many intersections?

Measuring the entropy of English

What's the perfect encoding? How do you know?

Reinventing Entropy | Compression & Intelligence Part 1

Tie random ends: How many loops?

Covering 10 points, a surprisingly tricky puzzle.

Escher's most mind-bending piece

The subset sum puzzle

Escher's most mathematically interesting piece

Bacteria Grid Puzzle Solution

The most underappreciated formula | Exploring high-dimensional spheres

The lattice bacteria puzzle

Solution to the ladybug clock puzzle

The Hairy Ball Theorem

The ladybug clock puzzle

Two Minute Papers

Two Minute Papers
последний пост 11 часов назад

DeepSeek Just Solved AI's Billion Dollar Problem

This is OpenClaw On Steroids

Claude AI Knows More Than It Tells You

NVIDIA's New Free AI - A Gift To All of Us

AI Agents as "Games Masters"? 🎮🔥

DeepMind’s New AI Found A Strange New Way To Think

Meet the AI "Co-Scientist" Changing Everything 🤖🧪 #ai

Claude Opus 4.8: Lying Machine No More

A Second Nobel Prize for AlphaFold? 🧬🏆 #alphafold #deepmind #nobelprize #science #ai

Google's Jeff Dean On Data Center Fires, And The Future Of AI

Feynman vs. Einstein vs. Newton: Who Wins? 🧠🤔 #physics #ai #science #feynman #research

Google DeepMind CEO Likes Hard Questions

Insane AI Breakthroughs With Demis Hassabis

DeepSeek Just Changed How AI Sees Images Forever

NVIDIA’s New AI Is Fast For A Strange Reason

DataFest Video
последний пост None

Семинары JetBrains Research

Семинары JetBrains Research
последний пост None

Яндекс. Компьютерные науки

Яндекс. Компьютерные науки
последний пост 2 days, 15 hours назад

Почему мультимодальные модели — это база 🤖

Омни-модель: что это за зверь такой

Borealis — как обучить аудио-LLM по цене MacBook

Better LLM pre-training in NVFP4

Как безопасно выкатывать новые версии продуктовых AI-агентов

Как решаем оптимизационные задачи Яндекс Лавки с помощью uplift-моделей

HGRPO: Hierarchical Grouped Reward Policy Optimization for Multi-Turn Conversational Agents

Поиск по архивам: как мы переходим к осознанному распознаванию текста

Hacks and Defenses in Automatic Kernel Generation

Real-time video generation: where we are and what comes next

AI-генерация учебного контента и проверка открытых ответов студентов

Как оценивать нерандомизированные эксперименты быстрее и надёжнее?

EMPI Agent: фреймворк для нейроотличных студентов

Как выжать максимум из ML-моделей, когда данных слишком мало?

AI-тьютор и методы его оценки

ML Trainings
последний пост 1 day, 11 hours назад

Капитанский мостик 21.06.2026: Дыра у OpenAI | Китайцы выкупают Manus | Крути ручку для ИИ

Россия отстает в полупроводниках на полгода

Капитал и контроль над технологиями

Долги США и Китая: сравнение и различия

Департамент ИТ и фронтирные модели

Власти Вирджинии отклонили гигантский ЦОД под давлением общественности

Британия хочет внедрить ИИ, а Индия — запретить

Биологическое оружие и код: как обмануть LLM

Anthropic и американское правительство

Валентин Малых рассказывает о генерации TikTok

Капитанский мостик 14.06.2026: Claude по паспорту | ИИ-наушники от Яндекса | Тикток-наука

Ценность кода в глазах заказчика

Дмитрий и Валентин обсуждают ИИ и конфликт щита и меча

Как отличить чушь от не чуши в психологии

Как мы создаем продукт: экспертиза, которая ценится

Primer
последний пост 5 months, 3 weeks назад

Taking AI Doom Seriously For 62 Minutes

Simulating a single brain cell

🎧 Podcasts

Lex Fridman AI Podcast

Lex Fridman AI Podcast
последний пост 3 weeks, 3 days назад

#497 – Biggest Mysteries in Physics: Antimatter, Dark Energy & ToE – Don Lincoln

#496 – FFmpeg: The Incredible Technology Behind Video on the Internet

#495 – Vikings, Ragnar, Berserkers, Valhalla & the Warriors of the Viking Age

#494 – Jensen Huang: NVIDIA – The $4 Trillion Company & the AI Revolution

#493 – Jeff Kaplan: World of Warcraft, Overwatch, Blizzard, and Future of Gaming

#492 – Rick Beato: Greatest Guitarists of All Time, History & Future of Music

#491 – OpenClaw: The Viral AI Agent that Broke the Internet – Peter Steinberger

#490 – State of AI in 2026: LLMs, Coding, Scaling Laws, China, Agents, GPUs, AGI

#489 – Paul Rosolie: Uncontacted Tribes in the Amazon Jungle

#488 – Infinity, Paradoxes that Broke Mathematics, Gödel Incompleteness & the Multiverse – Joel David Hamkins

#487 – Irving Finkel: Deciphering Secrets of Ancient Civilizations & Flood Myths

#486 – Michael Levin: Hidden Reality of Alien Intelligence & Biological Life

#485 – David Kirtley: Nuclear Fusion, Plasma Physics, and the Future of Energy

#484 – Dan Houser: GTA, Red Dead Redemption, Rockstar, Absurd & Future of Gaming

#483 – Julia Shaw: Criminal Psychology of Murder, Serial Killers, Memory & Sex

Microsoft Research Podcast

Microsoft Research Podcast
последний пост 2 months назад

Can we AI our way to a more sustainable world?

Ideas: Steering AI toward the work future we want

Will machines ever be intelligent?

Trailer: The Shape of Things to Come

Ideas: Community building, machine learning, and the future of AI

Ideas: More AI-resilient biosecurity with the Paraphrase Project

NLP Highlights
последний пост None

Data Skeptic
последний пост 5 days, 9 hours назад

AutoLike

Student Spotlight: Aaron Payne, Data Analyst

The Future is Agentic in Recommender Systems

Book Ratings and Recommendations

Disentanglement and Interpretability in Recommender Systems

Collective Altruism in Recommender Systems

Niche vs Mainstream

Healthy Friction in Job Recommender Systems

Fairness in PCA-Based Recommenders

Video Recommendations in Industry

Eye Tracking in Recommender Systems

Cracking the Cold Start Problem

Designing Recommender Systems for Digital Humanities

DataRec Library for Reproducible in Recommend Systems

Shilling Attacks on Recommender Systems

SuperDataScience

SuperDataScience
последний пост 3 days, 16 hours назад

1002: Fable 5: The Full Story from Capabilities to Drama

1001: How AI Erased My Career Moat, an Episode #1001 Special: Jon Krohn interviewed by Kirill Eremenko

1000: Ten Years of the Super Data Science Podcast, with Jon, Kirill and Special Guests

999: What's Left to Build When Software Is Free, with Chip Huyen

998: In Case You Missed It in May 2026

997: How This AI Startup Hit 20M Users (No Moat)

996: TrueFoundry’s Nikunj Bajaj on How to Get $100M Returns on AI Agent Deployments

995: End-to-End Foundation Models for the Energy Industry, with Jazmia Henry

994: AI’s Putting Recent Grads Out of Work; Here’s How to Get Hired Anyway!

993: How to Build AI-First Organizations, with Jacob Miller and Jeremy Mumford

992: Tokenmaxxing vs AI Hardware Bottlenecks

991: Pair Programming with AI in Your Python Notebook, with Dr. Trevor Manz

990: Inside Mythos: Anthropic's Locked-Down Frontier Model

989: Security for Mythos-Era Agentic Risks, with Rubrik’s Anneka Gupta and Cal Al-Dhubaib

988: In Case You Missed It in April 2026

Data Science at Home

Data Science at Home
последний пост 1 month назад

Recommend and manipulate: the dangers of the attention economy

Social media is an ant mill (Internet is a disaster) (Ep. 303)

AI and videogames (Ep. 305)

AI and videogames: Conversational NPCs (Ep. 306)

AI tips & tricks (Ep. 307)

Europe, wake up! You Can’t Be a Superpower on Someone Else’s Servers (Ep. 304)

About Apple’s Privacy (Ep. 302)

Productivity is the new data breach (Ep. 301)

Programmable Money: The Cage They’ll Call Convenience (Ep. 300)

There Is No AI. There’s a Stateless Function on 10,000 GPUs Pretending to Know You (Ep. 299)

Bias in the machine (edited)

What is wrong with reinforcement learning? (Ep. 82)

How to generate very large images with GANs (Ep. 76)

Training neural networks faster without GPU [RB] (Ep. 77)

More powerful deep learning with transformers (Ep. 84) (Rebroadcast)