Skip to content
View sunshiding's full-sized avatar

Block or report sunshiding

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 22 2 Updated Feb 3, 2024

This is the repository for the paper 'Enhancing Uncertainty Modeling with Semantic Graph for Hallucination Detection' (AAAI2025)

Python 7 1 Updated Apr 5, 2025

Unified Automated Evaluation for Hallucination Detection and Fact Verification

Python 6 Updated Oct 29, 2025
JavaScript 177 8 Updated Sep 10, 2024

Collection of scripts and notebooks for OpenAI's latest GPT OSS models

Jupyter Notebook 485 53 Updated Aug 25, 2025

Code for paper Towards Mitigating LLM Hallucination via Self Reflection

Python 30 7 Updated Oct 9, 2023

This is the code for the paper "Self-contradictory Hallucinations of Large Language Models: Evaluation, Detection and Mitigation".

Python 37 3 Updated Sep 1, 2025

Official implementation for the paper "DoLa: Decoding by Contrasting Layers Improves Factuality in Large Language Models"

Python 532 66 Updated Jan 17, 2025

CEduMEval : A Chinese educational multi-task evaluation benchmark

Python 13 Updated Nov 18, 2024

AISystem 主要是指AI系统,包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术

Jupyter Notebook 15,881 2,274 Updated Sep 3, 2025

SelfCheckGPT: Zero-Resource Black-Box Hallucination Detection for Generative Large Language Models

Python 587 73 Updated Jun 26, 2024

GAOKAO-Bench is an evaluation framework that utilizes GAOKAO questions as a dataset to evaluate large language models.

Python 697 52 Updated Jan 7, 2025

👮‍♂️The sensitive word tool for java.(敏感词/违禁词/违法词/脏词。基于 DFA 算法实现的高性能 java 敏感词过滤工具框架。内置支持单词标签分类分级。请勿发布涉及政治、广告、营销、翻墙、违反国家法律法规等内容。高性能敏感词检测过滤组件,附带繁体简体互换,支持全角半角互换,汉字转拼音,模糊搜索等功能。)

Java 5,569 748 Updated Sep 5, 2025

Streamlit app for chatting with Meta Llama 3.2 using Ollama and LangChain

Python 8 6 Updated Oct 3, 2024

a curated list of the role of small models in the LLM era

Python 111 4 Updated Sep 23, 2024

The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, No-code agent builder, MCP compatibility, and more.

JavaScript 52,358 5,608 Updated Dec 19, 2025

Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and…

Python 36,834 6,091 Updated Nov 10, 2025

GUI for ChatGPT API and many LLMs. Supports agents, file-based QA, GPT finetuning and query with web search. All with a neat UI.

Python 15,415 2,265 Updated Aug 15, 2025

整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。

21,930 2,084 Updated May 19, 2025
Jupyter Notebook 3 Updated May 19, 2023

A pytorch adversarial library for attack and defense methods on images and graphs

Python 1,075 190 Updated Jun 26, 2025

Model interpretability and understanding for PyTorch

Python 5,497 548 Updated Dec 20, 2025

A Python package to assess and improve fairness of machine learning models.

Python 2,176 478 Updated Dec 19, 2025

This repository introduces different Explainable AI approaches and demonstrates how they can be implemented with PyTorch and torchvision. Used approaches are Class Activation Mappings, LIMA and SHa…

Jupyter Notebook 30 5 Updated Jul 1, 2022

A game theoretic approach to explain the output of any machine learning model.

Jupyter Notebook 24,843 3,463 Updated Dec 11, 2025

The project page of paper: Trusted Multi-View Classification [ICLR'2021 paper]

Python 274 48 Updated Sep 24, 2024

Reliable Conflictive Multi-view Learning

Python 93 8 Updated Mar 24, 2024

Official github repo for SafetyBench, a comprehensive benchmark to evaluate LLMs' safety. [ACL 2024]

Python 267 12 Updated Jul 28, 2025

Python-based Comprehensive Network Packet Analysis Library

Python 258 31 Updated Dec 20, 2025
Next