Skip to content
@swiss-ai

swiss-ai

Popular repositories Loading

  1. mmore mmore Public

    Massive Multimodal Open RAG & Extraction A scalable multimodal pipeline for processing, indexing, and querying multimodal documents Ever needed to take 8000 PDFs, 2000 videos, and 500 spreadsheets …

    Python 142 29

  2. apertus-tech-report apertus-tech-report Public

    Tech Report of the Apertus LLM Suite

    118 4

  3. pretrain-data pretrain-data Public

    Pretraining data reconstruction scripts for Apertus

    Python 93 4

  4. Megatron-LM Megatron-LM Public

    Forked from NVIDIA/Megatron-LM

    Ongoing research training transformer models at scale

    Python 32 13

  5. MoE MoE Public

    some mixture of experts architecture implementations

    Python 22 3

  6. parity-aware-bpe parity-aware-bpe Public

    Parity-Aware Byte-Pair Encoding: Improving Cross-lingual Fairness in Tokenization [arXiv 2025]

    Python 15 3

Repositories

Showing 10 of 49 repositories
  • pretrain-data Public

    Pretraining data reconstruction scripts for Apertus

    swiss-ai/pretrain-data’s past year of commit activity
    Python 93 Apache-2.0 4 0 1 Updated Oct 9, 2025
  • swiss-ai/reasoning_getting-started’s past year of commit activity
    Shell 1 0 0 0 Updated Oct 8, 2025
  • mmore Public

    Massive Multimodal Open RAG & Extraction A scalable multimodal pipeline for processing, indexing, and querying multimodal documents Ever needed to take 8000 PDFs, 2000 videos, and 500 spreadsheets and feed them to an LLM as a knowledge base? Well, MMORE is here to help you!

    swiss-ai/mmore’s past year of commit activity
    Python 142 Apache-2.0 29 23 8 Updated Oct 8, 2025
  • swiss-ai/model-spinning’s past year of commit activity
    Python 7 2 0 0 Updated Oct 8, 2025
  • Megatron-LM Public Forked from NVIDIA/Megatron-LM

    Ongoing research training transformer models at scale

    swiss-ai/Megatron-LM’s past year of commit activity
    Python 32 3,220 6 18 Updated Oct 7, 2025
  • verl Public Forked from volcengine/verl

    verl: Volcano Engine Reinforcement Learning for LLMs

    swiss-ai/verl’s past year of commit activity
    Python 0 Apache-2.0 2,534 0 0 Updated Oct 6, 2025
  • swiss-ai/posttraining-data’s past year of commit activity
    Python 0 0 1 0 Updated Sep 30, 2025
  • lm-evaluation-harness Public Forked from EleutherAI/lm-evaluation-harness

    A framework for few-shot evaluation of language models.

    swiss-ai/lm-evaluation-harness’s past year of commit activity
    Python 1 MIT 2,790 0 2 Updated Sep 25, 2025
  • pretrain-code Public

    Pretraining codebase for Apertus models, based on Megatron-LM

    swiss-ai/pretrain-code’s past year of commit activity
    Shell 12 Apache-2.0 2 0 0 Updated Sep 25, 2025
  • evals Public
    swiss-ai/evals’s past year of commit activity
    Python 2 2 0 1 Updated Sep 25, 2025

Most used topics

Loading…