Skip to content
@amplec

AMPLEC

Automated Malware Processing with Language Explanation for Consumers

AMPLEC: Automated Malware Processing with Language Explanation for Consumers

amplec_logo


ATTENTION: THIS PROJECT IS UNDER DEVELOPMENT/WILL BE DEVELOPED IN THE FUTURE


AMPLEC (Automated Malware-analysis Processing with Language Explanation for Consumers) is a software project designed to create a system that uses a Large Language Model (LLM) to interpret and explain the results of an automated malware analysis pipeline, known as Karton. The system aims to simplify the complex data generated by malware analysis, providing clear and concise explanations in natural language for security analysts.

Features of AMPLEC

  1. Automated Interpretation:

    • Uses LLM to interpret and explain malware analysis results in natural language.
  2. User Interaction via Prompts:

    • Provides predefined prompts for users to select relevant interpretations quickly.
  3. Dynamic Data Handling:

    • Manages evolving data from malware analysis, adapting to new threats and pipeline changes.
  4. System Integration:

    • Integrates with existing systems via APIs using Python and Flask.
  5. Optional Advanced Features:

    • Retrieval Augmented Generation (RAG): Adds context from external data sources.
    • Function Calling: Allows the LLM to trigger further analysis tasks.
    • Open Prompting: Users can create custom prompts for more flexibility.

Implementation Overview

  • Architecture:

    • Three main components: the Karton malware analysis system, the LLM, and a web interface/API.
    • The LLM interprets analysis results and communicates with the web interface.
  • System Connections:

    • Uses APIs for component communication, implemented in Python.
  • LLM Implementation:

    • Selects and configures a local LLM model.
    • Utilizes Langchain for integrating actions and potential RAG extension.

Evaluation Plan

  • The system's effectiveness will be tested with known malware samples and compared with existing tools and human analysts to identify strengths and areas for improvement.

Popular repositories Loading

  1. .github .github Public

  2. main main Public

    This repo will contain documentation and a script or runner to start all the other repositorie's containers.

  3. ui ui Public

    This repo will contain a container, which will spin up a UI and maybe also host the api.

    Python

  4. core core Public

    This is the core (backend) repo of this project, in here the llm API will be used, aswell as the preprocessing done.

    Python

  5. utils utils Public

    This repo is a pip packet, built to house utility classes and functions that for example provide logging.

    Python

Repositories

Showing 5 of 5 repositories

Top languages

Loading…

Most used topics

Loading…