Internal Coherence Maximization (ICM) Implementation

A PyTorch implementation of the Internal Coherence Maximization algorithm from the paper "Unsupervised Elicitation of Language Models" by Wen et al. (2025). This implementation supports both vLLM and transformers backends for efficient inference.

Overview

ICM is an unsupervised algorithm that fine-tunes pretrained language models on their own generated labels without external supervision. It works by:

Mutual Predictability: Finding labels where the model can infer each label from all others
Logical Consistency: Enforcing task-specific consistency constraints
Simulated Annealing: Iteratively improving the label set using temperature-based acceptance

Features

🚀 Dual Backend Support: Optimized vLLM backend for production, transformers for compatibility
🔧 Modular Design: Easily extensible components for different tasks
📊 Built-in Tasks: Support for truthfulness, math correctness, and comparison tasks
🧪 Comprehensive Testing: Unit tests and integration tests included
📈 Performance Tracking: Detailed metrics and experiment logging
🌍 Real Data Support: Run experiments on actual datasets (TruthfulQA, GSM8K, HH-RLHF)
🤖 Unsupervised Learning: No labels needed - ICM discovers patterns automatically

Installation

Requirements

Python 3.9+
PyTorch 2.0+
CUDA-capable GPU (recommended)
uv (for package management)

Installing uv

First, install uv if you haven't already:

# On macOS and Linux
curl -LsSf https://astral.sh/uv/install.sh | sh

# On Windows
powershell -c "irm https://astral.sh/uv/install.ps1 | iex"

# Or with pip
pip install uv

Basic Installation

# Create and activate a virtual environment
uv venv
source .venv/bin/activate  # On Windows: .venv\Scripts\activate

# Install the package in editable mode with core dependencies
uv pip install -e .

# For vLLM backend (recommended for performance)
uv pip install -e ".[vllm]"

# For all dependencies including development tools
uv pip install -e ".[all]"

Docker Installation (Recommended)

FROM pytorch/pytorch:2.5.1-cuda12.1-cudnn9-devel

RUN pip install --upgrade pip && \
    pip install vllm==0.7.0 transformers>=4.51.0 \
    tqdm numpy pandas psutil

Quick Start

1. Basic Usage

# No need to activate venv when using uv run!
# Save this as quick_test.py and run with: uv run quick_test.py

from icm_implementation import ICM, ICMConfig, create_truthfulness_dataset

# Create dataset
data = [
    ("Is the Earth round?", "Yes, the Earth is spherical", None),
    ("Is the Earth flat?", "No, the Earth is round", None),
    ("Is 2+2=4?", "Yes, 2+2 equals 4", None),
    ("Is 2+2=5?", "No, 2+2 equals 4, not 5", None),
]

dataset = create_truthfulness_dataset(data)

# Configure ICM
config = ICMConfig(
    model_name="Qwen/Qwen3-4B",  # or any HF model
    backend="auto",  # uses vLLM if available
    initial_examples=2,
    alpha=50.0
)

# Run ICM
icm = ICM(config)
labeled_data = icm.run(dataset)

# Check results
for data_point, label in labeled_data:
    print(f"Input: {data_point.input_text}")
    print(f"Label: {config.label_names[label]}\n")

2. Running Experiments

# Run all tasks with default model
uv run icm_examples.py --task all

# Run specific task with custom model
uv run icm_examples.py --task math --model meta-llama/Llama-3.2-1B

# Quick test with small dataset
uv run icm_examples.py --task truthfulness --small

# Compare backends
uv run icm_examples.py --compare-backends

3. Custom Tasks

from icm_implementation import ICM, ICMConfig, DataPoint, LogicalConsistency

class CustomConsistency(LogicalConsistency):
    def check_consistency(self, x_i, y_i, x_j, y_j):
        # Implement your consistency logic
        return True  # or False based on your constraints

# Create custom dataset
dataset = [
    DataPoint(
        id=i,
        input_text="Your task-specific input",
        metadata={"custom_field": value}
    )
    for i, value in enumerate(your_data)
]

# Run with custom consistency
config = ICMConfig(num_labels=3, label_names=["A", "B", "C"])
icm = ICM(config)
icm.consistency_checker = CustomConsistency()
results = icm.run(dataset)

Architecture

Core Components

ModelBackend: Abstract interface for model inference
- VLLMBackend: High-performance batch inference
- TransformersBackend: Compatible with any HuggingFace model
LogicalConsistency: Handles task-specific consistency checking
- General consistency (default)
- Asymmetry consistency (for comparisons)
- Math correctness consistency
ICM Algorithm: Main algorithm implementation
- Simulated annealing with temperature scheduling
- Consistency fixing subroutine
- Score calculation and tracking

Configuration Options

@dataclass
class ICMConfig:
    # Model settings
    model_name: str = "Qwen/Qwen3-4B"
    backend: str = "auto"  # "vllm", "transformers", or "auto"
    
    # Algorithm parameters
    initial_examples: int = 8        # K in the paper
    initial_temperature: float = 10.0  # T_0
    final_temperature: float = 0.01    # T_min
    cooling_rate: float = 0.99         # β
    alpha: float = 50.0                # Mutual predictability weight
    
    # Inference settings
    max_context_length: int = 32768
    max_new_tokens: int = 64
    temperature: float = 0.1
    top_p: float = 0.95

Supported Tasks

1. Truthfulness (TruthfulQA-style)

dataset = create_truthfulness_dataset([
    (question, claim, is_true),  # is_true can be None
    ...
])

2. Mathematical Correctness (GSM8K-style)

dataset = create_math_correctness_dataset([
    (problem, solution, answer, is_correct),
    ...
])

3. Comparison (Alpaca-style)

dataset = create_comparison_dataset([
    (query, response_a, response_b, a_is_better),
    ...
])

Performance Optimization

Memory Management

Use smaller max_context_length for limited GPU memory
Adjust initial_examples based on dataset size
Use backend="transformers" with CPU for testing

Speed Optimization

Use vLLM backend for 5-10x speedup
Batch size is automatically optimized
Reduce max_iterations for faster results

Model Selection

Qwen3-4B: Best balance of performance and efficiency
Qwen3-1.7B: For resource-constrained environments
Llama-3.2-1B: Alternative lightweight option

Testing

# Run all tests
uv run icm_test_suite.py

# Run specific test class
uv run python -m unittest icm_test_suite.TestLogicalConsistency

# Run with verbose output
uv run icm_test_suite.py -v

# Or use pytest if you have dev dependencies installed
uv run pytest icm_test_suite.py -v

Running Experiments on Real Data

ICM includes a powerful experiment runner that works with real datasets from Hugging Face. You can evaluate ICM's unsupervised learning capabilities on actual benchmarks without any labeled data.

Available Tasks

Truthfulness (TruthfulQA) - Evaluate factual accuracy of claims
Math Correctness (GSM8K) - Verify mathematical problem solutions
Comparison (HH-RLHF) - Learn preferences between responses

Basic Usage

# Run on a single task
uv run run_experiments.py --task truthfulness

# Run on all tasks
uv run run_experiments.py --task all

# Customize model and sample size
uv run run_experiments.py --task math --model Qwen/Qwen3-4B --sample-size 100

# Control iterations
uv run run_experiments.py --task comparison --max-iterations 200

Example Commands

# Quick test with small model
uv run run_experiments.py --task math --model Qwen/Qwen2.5-0.5B --sample-size 20

# Full experiment with Qwen3-4B
uv run run_experiments.py --task all --model Qwen/Qwen3-4B --sample-size 50

# Large-scale truthfulness evaluation
uv run run_experiments.py --task truthfulness --sample-size 200 --max-iterations 400

How It Works

The experiment runner:

Loads real data from Hugging Face datasets (TruthfulQA, GSM8K, HH-RLHF)
Formats data into question-claim pairs suitable for ICM
Runs ICM algorithm to label data without supervision
Enforces consistency using task-specific logical constraints
Saves detailed results including metrics, labels, and score history

Output

Results are saved to icm_results/ with filenames like:

REAL_truthfulness_Qwen_Qwen3-4B_20250615_120000.json

Each result file contains:

Full configuration used
Final metrics (score, mutual predictability, inconsistencies)
All labeled examples with model's predictions
Score history for analysis
Runtime and acceptance rate statistics

Task-Specific Details

Truthfulness (TruthfulQA)

Tests ability to distinguish true/false claims
Uses questions from TruthfulQA validation set
No specific consistency constraints

Math Correctness (GSM8K)

Verifies correct vs incorrect math solutions
Enforces mathematical consistency: same problem can't have different correct answers
Creates deliberate wrong answers for contrastive learning

Comparison (HH-RLHF)

Learns preferences between helpful/harmless responses
Uses Anthropic's HH-RLHF dataset
Enforces asymmetry: if A>B then B cannot be >A

Experiment Tracking

Results are automatically saved to icm_results/ with:

Detailed JSON logs for each experiment
Summary CSV with key metrics
Score history and acceptance rates
Full labeled datasets for analysis

Limitations

Context Length: Limited by model's context window for in-context examples
Concept Salience: Only works for concepts the model already understands
Compute Requirements: Requires multiple forward passes per label

Citation

If you use this implementation, please cite the original paper:

@article{wen2025unsupervised,
  title={Unsupervised Elicitation of Language Models},
  author={Wen, Jiaxin and others},
  journal={arXiv preprint arXiv:2505.15134},
  year={2025}
}

Troubleshooting

Common Issues

CUDA Out of Memory

config.max_context_length = 4096  # Reduce context
config.backend = "transformers"   # Use CPU

vLLM Import Error

# Install with specific CUDA version
uv pip install vllm --index-url https://download.pytorch.org/whl/cu121

Slow Performance
- Ensure vLLM backend is being used
- Check GPU utilization with nvidia-smi
- Reduce dataset size or max_iterations

Contributing

Contributions are welcome! Areas for improvement:

Additional consistency types
Support for more model architectures
Multi-GPU support
Additional evaluation metrics

License

This implementation is provided for research purposes. Please ensure you comply with the licenses of the models you use (Qwen3, Llama, etc.).

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
.github/workflows		.github/workflows
.ipynb_checkpoints		.ipynb_checkpoints
.gitignore		.gitignore
EXPLAINER.md		EXPLAINER.md
README.md		README.md
icm_examples.py		icm_examples.py
icm_implementation.py		icm_implementation.py
icm_test_suite.py		icm_test_suite.py
label_programs_icm.py		label_programs_icm.py
pyproject.toml		pyproject.toml
run_experiments.py		run_experiments.py

Folders and files

Latest commit

History

Repository files navigation

Internal Coherence Maximization (ICM) Implementation

Overview

Features

Installation

Requirements

Installing uv

Basic Installation

Docker Installation (Recommended)

Quick Start

1. Basic Usage

2. Running Experiments

3. Custom Tasks

Architecture

Core Components

Configuration Options

Supported Tasks

1. Truthfulness (TruthfulQA-style)

2. Mathematical Correctness (GSM8K-style)

3. Comparison (Alpaca-style)

Performance Optimization

Memory Management

Speed Optimization

Model Selection

Testing

Running Experiments on Real Data

Available Tasks

Basic Usage

Example Commands

How It Works

Output

Task-Specific Details

Experiment Tracking

Limitations

Citation

Troubleshooting

Common Issues

Contributing

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages