NEXUS rejects generic, repetitive content and delivers only what you truly care about ✅ — integrating real-world dynamic factors, leveraging multi-perspective reasoning to break conventional thinking patterns, and generating genuinely novel insights 💥. With a 60% faster analysis engine, NEXUS delivers deep intelligence in under 30 minutes at a fraction of the cost 🔍
NEXUS Framework: Domain Fine-Tuning × Deep Search × Attention-Driven Analysis × Digital Cognitive Twin — a Multi-Agent collaborative framework that breaks template-based analysis and reshapes intelligent content generation.
NEXUS bridges "information analysis" and "social system simulation", grounded in verified, real-world information flows.
NEXUS is a Multi-Agent Cognitive Simulation framework for complex real-world information systems. Driven by user Preference Anchors, NEXUS actively extracts deep insights and strategic intelligence aligned with individual cognitive goals. It systematically integrates high-fidelity, highly-relevant signals from social sentiment, policy dynamics, and financial intelligence — building an end-to-end simulation pipeline that reconstructs causal chains between key events in high-dimensional space, enabling dynamic scenario forecasting and genuine cognitive breakthroughs.
-
🚀 Beyond Generic Outputs — Personalized Cognitive Maps 🔥🔥🔥
- Deep search anchored to what users genuinely care about — not broad, generic information
- Builds a user-exclusive Cognitive Map rather than one-size-fits-all analysis
- 📖 Related: Farshidi et al. (2024), Understanding User Intent Modeling; Nguyen et al. (2018), Capsule Network Search Personalization
-
🔗 Domain-Specific Fine-Tuned Models over Generic LLMs 🔥🔥
- Integrates domain-adapted fine-tuned models — not off-the-shelf LLMs
- User attention anchor-driven information processing — not indiscriminate data handling
-
📈 Dynamic Trend Simulation over Static Prediction 🔥🔥
- Incorporates real-time, dynamic influence factors — not static or idealized assumptions
- Constructs a high-fidelity Digital Cognitive Twin world — not a simplified virtual environment
- 📖 Related: Guo et al. (2025), Beyond Static Retrieval in GraphRAG; Zhu et al. (2025), Conversational Intent-Driven GraphRAG
-
💡 Multi-Perspective Deep Insight — Jump Outside the Box 🔥🔥🔥
- Generates genuinely novel ideas — not rigid, template-driven analysis
- Produces vivid simulated internal monologues — not superficial surface-level outputs
- 📖 Related: Zhong et al. (2025), Beyond Sentiment; Saunders (2012), Autonomous Creative Systems
-
💡 Four Stakeholder Perspectives Simulated in Parallel 🔥🔥🔥
- ⚖️ Policymakers & Regulators — e.g., government bodies, think tanks
- 💰 Business & Market Actors — e.g., VC/PE firms, enterprises, market analysts
- 🧑🤝🧑 Public & Community — e.g., KOLs, NGOs, social opinion amplifiers
- 🧑🔬 Researchers & Experts — e.g., academics, industry specialists
-
🌟 60%+ Efficiency Gains over Prior Architecture 🔥🔥🔥
- Versus the prior
mirofisharchitecture: cost reduced by 60%, simulation time cut to 1/3 - Achieved via deep integration of background knowledge priors + user attention anchor focusing, dramatically reducing token consumption while preserving analytical depth
- Cost: Single simulation reduced from $5 → $2–3 (40–60% reduction)
- Speed: Only 30 simulation rounds needed vs. prior 96 rounds; total time 18–20 min vs. 50 min (>60% faster)
- Versus the prior
NEXUS supports multi-domain, multi-role, multi-perspective, high-fidelity simulation of complex information systems and future scenario forecasting.
- 🔮 Ripple Effect Forecasting: Dynamically simulate the economic, social, and public safety chain reactions following policy announcements
⚠️ Early Risk Detection: Identify latent social conflicts and public sentiment crises; output proactive intervention strategies- 📊 Causal Impact Evaluation: Rigorously decompose the causal effects of policy adjustments with evidence-backed analysis
- 🔮 Trend Intelligence: Fuse multi-source signals with user anchors to dynamically simulate industry inflection points and shifts in consumer mindset
- ⚔️ Competitive Landscape Simulation: Real-time modeling of competitor moves and market responses; assess the impact potential of new products, pricing, and campaigns
- 🧪 Strategy Validation: Run multi-scenario market simulations to pre-test revenue, market share, and brand reputation outcomes
- 📊 Emotion Heatmap Tracking: Real-time monitoring of social media discussions and the dynamic evolution of public sentiment
- 🎯 Key Voice Identification: Pinpoint opinion leaders (KOLs) and quantify their true influence weight on events
- ⚡ Sentiment Storm Early Warning: Anticipate potential viral events; simulate propagation scale, pathways, and risk levels
- 🧠 Knowledge Graph Construction: Integrate academic papers, policy reports, and cutting-edge technology into a queryable, reasoning-ready dynamic knowledge network
- 🔮 Frontier Trend Detection: Based on historical data and real-time signals, precisely capture the early emergence of new technologies, market opportunities, and research hotspots
💡 Not just prediction — simulation. Stay one cognitive step ahead, with every decision grounded in evidence.
| Step | Module | Description | Tags |
|---|---|---|---|
| 🎯 Step 1 | Domain-Adaptive Fine-Tuning | Focus on user attention anchors; enhance domain-specific information comprehension via LLaMA Factory Pro with EasyDataset augmentation + Ollama deployment | #fine-tuning #attention-anchors #local-deploy |
| 🤖 Step 2 | Customized Deep Search Agents | Extract high-fidelity, user-goal-aligned deep insights via UPAIRS-Agents (pressure-driven iterative reflection) + CognitiveTemp-DeepSearch-Agents (user-controllable search) | #deep-search #user-control #high-fidelity |
| 🤖 Step 3 | Attention-Driven Analysis Agents | Real-time user focus capture via Google Hypothesis; four-agent chain (Signal Parsing → Evidence Review → Trend Inference → Strategic Synthesis) transforms fragmented data into actionable intelligence | #attention-driven #chain-of-thought #trend-strategy |
| 🌍 Step 4 | Digital Cognitive Twin — Canyon 🏜 | GraphRAG-based knowledge graph; diverse virtual agent generation; environment factor injection for sentiment dynamics; multi-round simulation + event development reports + simulated interviews | #digital-twin #sentiment-simulation #virtual-interview |
| 🔁 Step 5 | End-to-End Cognitive Reasoning Loop | Integrates all modules; supports multi-role, multi-perspective cognitive simulation from real-world information flows to future scenario forecasting | #end-to-end #scenario-forecasting |
| Module | Function | Implementation |
|---|---|---|
| 🎯 Domain Fine-Tuning | Focus on User Attention Anchors; elevate domain-specific understanding | LLaMA Factory Pro — data augmentation + LoRA/QLoRA + Ollama deployment |
| 🤖 Custom Deep Search Agents | Extract insights aligned with user cognitive goals | UPAIRS-Agents (relevance & confidence) + CognitiveTemp-DeepSearch (user-anchor-driven) |
| 🤖 Attention-Driven Analysis Agents | Real-time focus capture; chain-analysis + strategic integration | Google Hypothesis annotation → Agent-A (signal parsing) → B (evidence review) → C (trend inference) → D (strategic synthesis) |
| 🌍 Digital Cognitive Twin — Canyon 🏜 | High-fidelity virtual environment; real-world sentiment reproduction; deep simulation | GraphRAG knowledge graph → virtual agent generation → environment factor injection → multi-round simulation → simulated interviews |
Goal: Anchor on user attention points; enhance model comprehension for specific domains (e.g., current affairs, social discourse, financial news, policy documents).
All-in-One Pipeline: Data Management → Data Augmentation → Model Fine-Tuning → Model Quantization → Local Deployment 📦
Powered by the personal pipeline project LLaMA Factory Pro:
- Input: User-defined unstructured data (PDFs, news articles, policy documents, etc.) → fully automated QA-pair generation
- Output: Enhanced fine-tuned model + online monitoring + deployment
1️⃣ Cloud Platform AutoDL ☁️ — Automated fine-tuning environment (Docker image config), .venv one-stop management, cloud storage support (AWS S3, Azure Blob, Hugging Face Hub, Ollama Registry)
2️⃣ Data Engineering — EasyDataset: auto-detects multimodal formats (text/image/mixed), auto-generates high-quality context-aware QA pairs
3️⃣ Training — Hyperparameter configuration → online supervised fine-tuning monitoring → auto-save to cloud storage upon completion
4️⃣ Deployment — Local model serving via Ollama / llama.cpp
Goal: Provide high-quality analytical priors for NEXUS through deep search, user-focus anchoring, multi-agent reasoning, and structured information modeling.
This module consists of two components: UPAIRS-DS (Pressure-Driven) and CognitiveTemp-DeepSearch Agents.
|
|
🔎 UPAIRS Deep Search Engine Built on UPAIRS Agents, this engine uses a Pressure-Driven paragraph-level iterative reflection architecture to continuously refine retrieval paths in complex information environments, ensuring high factual fidelity. 🔎 CognitiveTemp-DeepSearch Users can further customize search strategies via CognitiveTemp-DeepSearch-Agents to obtain results with stronger relevance, reliability, and timeliness. Key mechanisms:
|
|
Figure 3: Deep Search Interface (left) — Query Input and Search Result Exploration (right) |
|
This module is composed of three sub-modules:
(1) User Attention Anchoring → (2) Multi-Agent Deep Analysis → (3) Structured Knowledge Output
Users perform targeted deep searches (e.g., academic topics like GraphRAG or social events) with customizable parameters: recency, relevance threshold, and source selection.
User Focus Annotation — Via the Hypothesis API Google Plugin, users highlight content of interest and add free-form natural language annotations (notes, questions, doubts). The system captures all highlights and annotations in real time, outputting structured JSON / JSONL data.
Figure 7: GraphRAG Search Detail — Entity & Relation Visualization (left); Social Opinion Search — Semantic & Context Highlight (right)
| Agent | Role | Responsibility | Input | Output |
|---|---|---|---|---|
| 🤖-A | Trend Signal Analyst | Extract trend signals, evolution paths, and attention heat from user focus data | User annotation JSON | Trend signal report |
| 🤖-B | Content Evidence Reviewer | Extract technical evolution evidence, timelines, and key milestones from page content | Page content JSON | Technical evolution report |
| 🤖-C | Event-Trend Inference Specialist | Infer technology diffusion paths and industry adoption dynamics from related events | Related event JSON | Event trend correlation report |
| 🤖-D | Chief Strategic Synthesizer | Integrate A+B+C outputs; generate strategic document for trend forecasting | Reinforced reports (A+B+C) | High-quality strategic report (high relevance, timeliness, fidelity, customization) |
In one line:
Agent-A/B/C Chain Analysis + Agent-D Strategic Synthesis → High-relevance, high-timeliness, high-fidelity, highly-customized strategic reports
Analysis mode control: Express / Balanced / Deep | Relevance threshold configurable
Figure 8: Report Generation Interface (left) — User Input & Configuration; Generated Report Result (right) — Structured Summary & Insights
Goal: Within a high-fidelity virtual digital twin environment, simulate real-world public sentiment, event development, and user attention dynamics through multi-agent interactions for deep insight and predictive forecasting.
User-defined attention anchors (topics, key entities, specific events) are fused with multi-source information. Structured entity-relation extraction builds an enhanced GraphRAG representation with self-loop relations and multi-dimensional links, improving semantic depth and traceability.
# Entity & Relation Extraction Example (GraphRAG Academic Scenario)
Technical Foundations
├─ RAG (Retrieval-Augmented Generation)
├─ Knowledge Graph
│ ├─ is_a ...
│ └─ used_in ...
Frameworks
├─ GraphRAG
│ ├─ uses → RAG
│ └─ built_on → Knowledge Graph
Application Systems
├─ MED-COPILOT
│ ├─ uses → GraphRAG
│ ├─ uses → Knowledge Graph
│ └─ has_risk → ⚠️ Hallucination / Calibration Bias
...| Figure 10a: Graph Build — Initial stage: entity and relation import. | Figure 10b: Graph Expansion — Adding links and multi-dimensional connections. |
| Figure 10c: Graph Visualization — Global view of all nodes and relationship network structure. | Figure 10d: Complete Knowledge Graph — Full graph supporting multi-hop reasoning and querying. |
Based on the information priors from Step 1, the high-fidelity virtual world Canyon is initialized with trending topics, contextual background, and activation sequences. Multiple virtual agents are generated, each with unique knowledge bases, cognitive structures, behavioral tendencies, and data associations.
Each virtual agent persona includes:
- Basic Attributes: Age, occupation, domain expertise
- Cognitive Characteristics: Information processing style, decision-making approach, risk preference
- Behavioral Tendencies: Social inclination, information-sharing habits
- Influence Level: Scope and weight of impact within the simulated ecosystem
Sample Virtual Agents (GraphRAG Academic Scenario):
| Virtual Agent | Identity | Basic Attributes | Cognitive Characteristics | Behavioral Tendencies | Influence |
|---|---|---|---|---|---|
| edged732 @Edge, D. | AI researcher, specializing in GraphRAG | Age 35, Researcher | Prefers logical reasoning and graph structure analysis; rigorous decision style; moderate risk tolerance | Highly active, frequently shares academic resources, participates in technical community discussions | Medium-High — GraphRAG & knowledge graph domain |
| graphragArXiv_953 @Arxiv | GraphRAG research dissemination platform | Institutional agent, research-focused | Literature-integration approach, neutral decision-making | Pushes research outputs, monitors academic landscape | High — GraphRAG research community |
| Dev_808 @Developer | LLM application development technical community | Community agent, AI sharing-focused | Values practical learning; experimental decision approach | Frequently shares code, models, and projects; highly interactive | Medium-High — efficient technical community propagation |
| neo4j_650 @neo4j | Graph database technology provider | Enterprise agent, database expert | Data-structure oriented; emphasizes complete knowledge graph optimization | Shares best practices; provides technical consulting and tooling support | High — knowledge graph & enterprise application scenarios |
| Readers_792 @Reader | AI algorithm engineer | User-type agent, algorithm research focus | Prefers algorithm understanding and engineering implementation; technology-driven decision making | Follows technical news, engages in community discussions and learning | Medium — indirect ecosystem influence as information receiver and feedback provider |
| Figure 11a: Agent Persona Build — Initial stage: basic attribute definition. | Figure 11b: Agent Persona Expansion — Adding preferences and behavioral inclinations. |
| Figure 11c: Agent Persona Refinement — Adjusting cognitive strategies and interaction styles. | Figure 11d: Agent Persona Completion — Full cognition and behavior patterns seeded from real-world data. |
Within the Canyon digital twin world, the system simulates the evolution of real-world public sentiment by injecting environmental factors to guide information flows and opinion dynamics. This step models sentiment evolution, hot-topic aggregation, and information diffusion patterns.
| Figure 12a: Hot Topic Detection — Identifying high-attention topics within the Canyon virtual world. | Figure 12b: Canyon Simulation Overview — System-wide operation and event flow structure. |
| Figure 12c: Virtual Agent Dialogue Sample 1 — Agent-to-agent discussion of trending topics. | Figure 12d: Virtual Agent Dialogue Sample 2 — Diverse agent reactions and cognitive simulations. |
The system conducts multi-round iterative simulation, generating event development reports covering outbreak dynamics, sentiment diffusion trends, and critical node analysis. Each virtual agent's reactions and behavior patterns are recorded, enabling deep insight queries and predictive analysis.
This project compares two contrasting scenarios:
- Academic Scenario: GraphRAG technology development
- Social Media Scenario: Bilibili streamer account-destruction incident
The academic research scenario (GraphRAG technology) and the social media scenario (Bilibili account incident) represent two informationally distinct ecosystems. Through Agent Interview Simulation in Canyon, agents exhibited markedly different cognitive frameworks, information processing approaches, reasoning pathways, and behavioral responses to identical topics or events. Academic agents tended toward structured knowledge integration, technical evolution analysis, and theoretical reasoning, while social media agents focused on emotional propagation, collective responses, and opinion evolution dynamics. This contrastive simulation reveals the underlying cognitive and propagation mechanisms across different information domains.
After simulated interviews and deep analysis, users can interactively engage with the generated virtual world reports and individual virtual agents. Through multi-turn dialogue and strategy exploration, users gain access to each virtual entity's latent reasoning paths, internal concerns, and projected behavioral trends.
Structured Comparative Analysis — Academic vs. Social Media:
| Dimension | Academic — GraphRAG | Social Media — Bilibili Incident |
|---|---|---|
| Research Theme | Deep fusion of Knowledge Graph and LLM; complex knowledge reasoning, multi-hop retrieval, and explainable AI deployment | Virtual asset loss triggering platform governance debate; analysis of opinion propagation and platform responsibility boundaries |
| Core Problem | How GraphRAG architecture enables structured knowledge retrieval, multi-hop reasoning, and explainability to enhance LLM decision support in professional domains | Legal characterization of virtual accounts and digital assets; streamer behavioral norms; platform governance accountability |
| Current Stage | Technology Readiness Level TRL 5–6 → TRL 6–7: transitioning from lab validation to enterprise application exploration | Public discourse shifting from emotional condemnation to institutional and legal-level discussion |
| Key Drivers | Multimodal knowledge integration; RL-optimized retrieval; incremental knowledge graph updates with real-time data fusion | Streamer conduct norms; legal characterization of virtual property; platform accountability and user rights protection |
| Industry Impact | Priority deployment in healthcare, finance, and legal sectors; driving enterprise knowledge systems and AI decision platforms | Driving live streaming conduct standards; establishing virtual asset dispute resolution mechanisms |
| Typical Case | A top-tier securities firm built a compliance review system with GraphRAG, reducing review time from 3–5 days to 2 hours; accuracy improved from 78% to 92% | Account destruction event caused follower data to be wiped, triggering tens of millions of views in public discourse |
| Long-Term Trend | GraphRAG positioned as core infrastructure for enterprise AI knowledge systems; driving LLM evolution toward explainable, verifiable professional decision systems | Driving improvements to virtual asset protection and platform governance; digitalecosystem moving toward greater regulation and institutionalization |
Official Position vs. Inner Monologue (NEXUS's unique insight capability — revealing authentic sentiment):
| Scenario | Official Position (Public-Facing) | Inner Monologue (Authentic Sentiment) |
|---|---|---|
| Academic / GraphRAG Financial Institution Agent |
From an exchange data perspective, GraphRAG has demonstrable commercial cases in finance, primarily at leading institutions. Industry sentiment is cautiously optimistic — acknowledging technical potential while maintaining prudent expectations for large-scale commercialization. Example: a top brokerage used GraphRAG to optimize compliance review, reducing manual review from 3–5 days to 2 hours, improving accuracy from 78% to 92%, saving 30% in labor costs. Institutional feedback segments into three layers: tech teams (most enthusiastic, testing cross-market arbitrage), risk/compliance teams (pragmatic, focused on accuracy and explainability), and business units (conservative, requiring clear ROI evidence). | Honestly, this question hits close to home. In exchange data services, we're always both excited and afraid of new technology. Our firm started internal GraphRAG testing last year — calling it "revenue-generating" might be premature, but the commercial value is real. A mid-tier broker deployed it and can now rapidly identify upstream/downstream risk transmission paths, boosting analyst efficiency by 30%. Last month at a closed-door fintech session, the consensus was that GraphRAG is still in "technical validation mode." A friend summed it up well: "The concept is sexy, but putting it in a live trading system? I can't sleep at night." The hallucination rate issue is real — finance demands near-perfect accuracy. |
| Social Media / Bilibili Platform Policy Agent |
Our position is clear: rather than abolishing QR-code login, we advocate for optimizing platform security and elevating risk awareness among both streamers and users. QR-code login offers real convenience but carries risk. We recommend: adding pre-scan risk warning dialogs; establishing a "temporary authorization" mechanism limiting post-scan operational permissions; providing streamers with audit trail functionality. A blanket ban is not feasible, but stricter governance is necessary. | Honestly, this issue is genuinely complicated. Donations, follower data — all wiped overnight — there's no outlet for that grief. We issued a statement calling for platform accountability, but the impact was limited. I know many small streamers who built their entire following through QR-code interactions. Xiao Wei grew 5,000 followers in a month that way — that's her livelihood. I hope platforms build comprehensive protection mechanisms rather than simply banning the feature. Virtual property safety must be guaranteed. |
📑 Detailed analytical reports:
-
Nguyen, N. T., et al. (2018). A Capsule Network-based Embedding Model for Search Personalization. arXiv:1804.04266. 📄 https://arxiv.org/abs/1804.04266
-
Farshidi, F., et al. (2024). Understanding User Intent Modeling for Conversational Recommender Systems. User Modeling and User-Adapted Interaction, Springer Nature. 🔗 https://link.springer.com/article/10.1007/s11257-024-09398-x
-
Guo, Q., et al. (2025). Beyond Static Retrieval: Opportunities and Pitfalls of Iterative Retrieval in GraphRAG. arXiv:2509.25530. 📄 https://arxiv.org/abs/2509.25530
-
Zhu, Y., et al. (2025). Conversational Intent-Driven GraphRAG. arXiv:2506.19385. 📄 https://arxiv.org/abs/2506.19385
-
Zhong, V., et al. (2025). Beyond Sentiment: Cognitive and Narrative Open-Ended Generation. arXiv:2502.13925v1. 📄 https://arxiv.org/html/2502.13925v1
-
Saunders, R. (2012). Towards Autonomous Creative Systems: A Computational Approach. Cognitive Computation, 4(3), 216–225. 🔗 https://www.researchgate.net/publication/257788397
| Repository | Description |
|---|---|
| Microsoft GraphRAG | Knowledge Graph-enhanced Retrieval-Augmented Generation for complex multi-hop reasoning |
| LLaMA Factory Pro | Enterprise-grade LLaMA fine-tuning pipeline: data augmentation + LoRA/QLoRA + Ollama deployment |
| UPAIRS-Agents | Pressure-Driven paragraph-level iterative reflection deep search architecture |
| CognitiveTemp-DeepSearch-Agents | User attention anchor-driven customizable deep search agents |
| DeepSearchAgent-Demo | Multi-step deep search agent demo |
| Repository | Description |
|---|---|
| Hypothesis h-api | Web & PDF annotation API for real-time user attention capture |
| Ollama | Local LLM deployment framework supporting diverse model inference |
| LoRAFusion | Multi-LoRA weight fusion toolkit for model extension |
| llama.cpp | Lightweight, efficient LLaMA C/C++ inference library |
| easy-dataset | Streamlined dataset construction, conversion, and splitting |
@article{nguyen2018capsule,
title={A Capsule Network-based Embedding Model for Search Personalization},
author={Nguyen, Nam Tran and et al.},
journal={arXiv preprint arXiv:1804.04266},
year={2018}
}
@article{farshidi2024understanding,
title={Understanding User Intent Modeling for Conversational Recommender Systems},
author={Farshidi, F. and et al.},
journal={User Modeling and User-Adapted Interaction},
year={2024},
doi={10.1007/s11257-024-09398-x}
}
@article{guo2025graphrag,
title={Beyond Static Retrieval: Opportunities and Pitfalls of Iterative Retrieval in GraphRAG},
author={Guo, Q. and et al.},
journal={arXiv preprint arXiv:2509.25530},
year={2025}
}
@article{zhu2025conversational,
title={Conversational Intent-Driven GraphRAG},
author={Zhu, Y. and et al.},
journal={arXiv preprint arXiv:2506.19385},
year={2025}
}
@article{zhong2025beyond,
title={Beyond Sentiment: Cognitive and Narrative Open-Ended Generation},
author={Zhong, V. and et al.},
journal={arXiv:2502.13925v1},
year={2025}
}
@article{saunders2012autonomous,
title={Towards Autonomous Creative Systems: A Computational Approach},
author={Saunders, R.},
journal={Cognitive Computation},
volume={4},
number={3},
pages={216--225},
year={2012}
}