The Task Parallel System Composer (TaPaSCo)
-
Updated
Feb 6, 2026 - Verilog
The Task Parallel System Composer (TaPaSCo)
XCrypto: a cryptographic ISE for RISC-V
Hardware accelerator for convolutional neural networks
[ASAP 2020; FPGA 2020] Hardware architecture to accelerate GNNs (common IP modules for minibatch training and full batch inference)
This project is being developed as part of a Master's degree research sponsored by Brazil's CNPQ. It's goal is to design a hardware architecture to accelerate the AV1 arithmetic encoder.
Real-Time Hardware Sorter, Using A Multi-Dimensional Sorting Algorithm
This repository contains the design and implementation of a Spiking Neural Network (SNN) Processor. Spiking Neural Networks are a biologically-inspired class of artificial neural networks, where neurons communicate by sending discrete spikes.
grayscale conversion system and simple convolution system
Welcome to the repository for **Exercise 3 of the AI Systems Course** at the **University of Tehran**. This project focuses on designing and implementing a lightweight, efficient **processing element (PE)** for performing operations of neurons in a **multi-layer perceptron (MLP)** using Verilog.
Gravitational simulation of the N-body problem using FPGA hardware acceleration
MFCC Core written in SystemVerilog
It is the CORDIC implementation in Verilog. CORDIC is a hardware algorithm (CORDIC 硬體演算法) based on simple additions/subtractions and bit shifts. It can compute trigonometric, hyperbolic, exponentials, logarithms, multiplication, division, square, and square-root. And it is widely used in digital signal processing (DSP), embedded systems and FPGAs.
sequence detector with overlapped 2 patterns 010111 or 1101
A minimal FPGA accelerator kernel for hyperdimensional computing (HDC) inference
Hardware recreation of Atari Breakout on an FPGA using SystemVerilog and VGA graphics. Includes FSMs, real-time collision logic, and memory-mapped display.
High-performance FAST Corner Detector on PYNQ with using Hardware Acceleration (80+ FPS).
The IPPro is a 16-bit signed fixed-point, five-stage balanced pipelined RISC architecture that exploits the DSP48E1 features and provides balance among performance, latency and efficient resource utilization.
Specialized FPU for Fast Inverse Square Root Algorithm
Add a description, image, and links to the hardware-acceleration topic page so that developers can more easily learn about it.
To associate your repository with the hardware-acceleration topic, visit your repo's landing page and select "manage topics."