skip-gram word embedding model by C++
-
Updated
Oct 31, 2024 - C++
Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.
skip-gram word embedding model by C++
Bachelor thesis - language identification of short texts (2014)
Production-ready RAG system with Endee vector database featuring hybrid search (semantic + BM25), neural re-ranking, and Streamlit UI for AI/ML knowledge base querying
A Language Interpreter as semantiC Experiment in natural language processing
Simple command-line tool to identify the language of a given text using 2 different identification models.
LinearCosine: Adding beats multiplying for lower-precision efficient cosine similarity
AI-powered ticket classification system built with Django and Endee Vector Database. Automatically routes customer support tickets using semantic similarity and sentence-transformers.
High-performance C++ NLP and ML inference backend for fake news headline analysis, deployed with FastAPI and Docker.
(C++) Creates a short resume based on text
High-performance daemon for real-time codebase indexing. Generates semantic embeddings locally to provide AI agents with tools for instant search interface.
DaisyKit examples for Android
The Repository Contains The CPP Program to Calculate the Cosine Similarity Between two Documents Text
Tesseract Open Source OCR Engine (main repository)
Wrappers for FastText Library used for fast text representation and classification.
Created by Alan Turing