Stars
Awesome Large Language Models for Vulnerability Detection
Program analysis tools built on tree-sitter (https://github.com/tree-sitter/tree-sitter).
LLMDFA: Analyzing Dataflow in Code with Large Language Models (NeurIPS 2024)
A neurosymbolic framework for vulnerability detection in code
A deep learning model for localizing bugs in C/C++ source code (USENIX'23)
Data creation, training and eval scripts for the IRCoder paper
An implementation of the ACL 2024 Findings paper "Generalization-Enhanced Code Vulnerability Detection via Multi-Task Instruction Fine-Tuning".
A Transformer-based Line-Level Vulnerability Prediction
A curated list of amazingly awesome Cybersecurity datasets
☠️ Ground-truth dataset for vulnerability prediction (known research datasets and data sources included such as NVD, CVE Details and OSV); tools to automatically update the data are provided.
The CVE Binary Tool helps you determine if your system includes known vulnerabilities. You can scan binaries for over 350 common, vulnerable components (openssl, libpng, libxml2, expat and others),…
[EMSE'26] An Empirical Study on the Effectiveness of Large Language Models for Binary Code Understanding
Reverse Engineering: Decompiling Binary Code with Large Language Models
The official implementation of "LightTransfer: Your Long-Context LLM is Secretly a Hybrid Model with Effortless Adaptation"
CleanVul: Automatic Function-Level Vulnerability Detection in Code Commits Using LLM Heuristics
vinzcamp8 / vul-LMGGNN
Forked from Vul-LMGNN/vul-LMGGNNCode for the paper - Source Code Vulnerability Detection: Combining Code Language Models and Code Property Graph
Code for the paper - Source Code Vulnerability Detection: Combining Code Language Models and Code Property Graph
DeepWukong: Statically Detecting Software Vulnerabilities Using Deep Graph Neural Network
Open source vulnerability DB and triage service.
SecVulEval is a dataset of C/C++ vulnerabilities.
MegaVul - The largest, high-quality, extensible, continuously updated, C/C++/Java vulnerability dataset
Repository for PrimeVul Vulnerability Detection Dataset
DiverseVul: A New Vulnerable Source Code Dataset for Deep Learning Based Vulnerability Detection (RAID 2023) https://surrealyz.github.io/files/pubs/raid23-diversevul.pdf