You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
🔎 A full-fledged search engine that crawls, indexes, perform analysis, and searches through the papaers listed on research gate website. It comes with a Flask webserver and a light-weight UI as well. It was a course project.
Documents and queries are represented as vectors. Each dimension corresponds to a separate term. If a term occurs in the document, its value in the vector is non-zero. Several different ways of computing these values, also known as (term) weights, have been developed. One of the best known schemes is tf-idf weighting (see the example below). The…
Coursework for CS 5154 - Information Retrieval. Dual level (grad & undergrad) course introducing information storage and retrieval with unstructured data. Includes concepts such as tf-idf, cosine similarity, relevance-based evaluation, text classification, clustering etc.