tokenizer
A grammar describes the syntax of a programming language, and might be defined in Backus-Naur form (BNF). A lexer performs lexical analysis, turning text into tokens. A parser takes tokens and builds a data structure like an abstract syntax tree (AST). The parser is concerned with context: does the sequence of tokens fit the grammar? A compiler is a combined lexer and parser, built for a specific grammar.
Here are 96 public repositories matching this topic...
An advanced tokenizer made with typescript
-
Updated
Sep 5, 2025 - TypeScript
this repository contient a minimal parser
-
Updated
Apr 26, 2020 - TypeScript
A parser generator.
-
Updated
Jul 14, 2022 - TypeScript
Interactive tool to understand how text is tokenized by language models. Visualize tokens, analyze efficiency, and optimize your LLM prompts.
-
Updated
Aug 26, 2025 - TypeScript
🪁 Parser grammar builder. Write parsers with pattern matching.
-
Updated
Oct 3, 2025 - TypeScript
A JSON parser made in typescript with tokenizer, ast, parser ...
-
Updated
May 7, 2024 - TypeScript
Lexical analyzer for JavaScript
-
Updated
Nov 16, 2022 - TypeScript
University Credits tokenazitation: Public good on-chain for global learning certification (ETH Global Istanbul 2023 Submission: prized). For university credits issued tokens and for less wealthy schools claiming an NFT for students evaluated succesfully.
-
Updated
Aug 27, 2024 - TypeScript
Design System construído com React, Storybook e Stitches, organizado em um monorepo para facilitar a colaboração e o gerenciamento.
-
Updated
Sep 2, 2024 - TypeScript
Interface interativa para simulação de pré-vendas de tokens e criação de lançamentos fictícios em tempo real.
-
Updated
Dec 1, 2025 - TypeScript
An extendable ChordPro parser.
-
Updated
Feb 24, 2018 - TypeScript
A wrapper library around https://github.com/takuyaa/kuromoji.js that intelligently groups Japanese morphemes into words
-
Updated
Feb 8, 2024 - TypeScript
Lightweight JSON tokenizer with symmetric dictionary encoding for compression and optimization
-
Updated
Nov 17, 2025 - TypeScript
A lightweight, fast, and extensible line-by-line content tokenizer and parser
-
Updated
Jul 29, 2025 - TypeScript
🇯🇵🍜 learner‑friendly Japanese tokenizer / segmenter
-
Updated
Apr 22, 2025 - TypeScript
Fast & Non-regex based lexical analysis for Deno.
-
Updated
Jan 11, 2021 - TypeScript
-
Updated
Apr 18, 2022 - TypeScript
- Followers
- 11k followers
- Website
- github.com/topics/parsing
- Wikipedia
- Wikipedia