I have been building scalable data systems for over a decade. Here I am writing about my experiences, and I hope you find these posts useful. Your comments and feedback are greatly appreciated. — Rares Vernica

Elevate Your LLM Applications: A Deep Dive into Advanced RAG Techniques πŸš€

Building Production-Ready Systems with Advanced Retrieval Strategies

Retrieval Augmented Generation (RAG) has revolutionized how Large Language Models (LLMs) access and utilize external knowledge, moving beyond static training data to deliver more accurate, relevant, and up-to-date responses. While basic RAG setups are powerful, achieving peak performance and addressing complex real-world scenarios often requires a more sophisticated approach. This... [Read More]

Dual-Engine Serverless SQL Lakehouse

Powerful lakehouse with DuckDB, Neon PostgreSQL, and Google Cloud Storage πŸŒŠπŸ¦†πŸ˜

The concept of a data lakehouse is gaining serious traction, and for good reason! It combines the flexibility and cost-effectiveness of a data lake with the data management features and ACID transactions of a data warehouse. Let’s dive into building a practical lakehouse setup using some fantastic open-source and cloud-native... [Read More]

Machine Learning in SciDB

Leveraging the SciDB Stream plug-in

Popular data processing platforms offer users the ability to inject an external process into the data processing pipeline. The data flowing through the data pipeline is fed as input to the external process, while the output produced by the process is fed back into the pipeline. The external process runs... [Read More]