Open Source Data Management Systems - Page 2

  • Level Up Your Cyber Defense with External Threat Management Icon
    Level Up Your Cyber Defense with External Threat Management

    See every risk before it hits. From exposed data to dark web chatter. All in one unified view.

    Move beyond alerts. Gain full visibility, context, and control over your external attack surface to stay ahead of every threat.
    Try for Free
  • Gen AI apps are built with MongoDB Atlas Icon
    Gen AI apps are built with MongoDB Atlas

    The database for AI-powered applications.

    MongoDB Atlas is the developer-friendly database used to build, scale, and run gen AI and LLM-powered apps—without needing a separate vector database. Atlas offers built-in vector search, global availability across 115+ regions, and flexible document modeling. Start building AI apps faster, all in one place.
    Start Free
  • 1
    Broot

    Broot

    A new way to see and navigate directory trees

    Get an overview of a directory, even a big one. That's what makes it usable where the old tree command would produce pages of output. Hit alt/enter and you're back to the terminal in the desired location. This way, you can navigate to a directory with the minimum amount of keystrokes, even if you don't exactly remember where it is. Broot is fast and doesn't block (any keystroke interrupts the current search to start the next one). Never lose track of file hierarchy while you search. Broot tries to select the most relevant file. You can still go from one match to another one using tab or arrow keys. You may also search with a regular expression. To do this, add a / before the pattern. You may also apply logical operators or combine patterns, for example searching test in all files except json ones could be !/json$/&c/test and searching carg both in file names and file contents would be carg|c/carg.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 2
    Cytoscape.js

    Cytoscape.js

    Graph theory library for visualization and analysis

    A fully featured graph library written in pure JS. Permissive open source license (MIT) for the core Cytoscape.js library and all first-party extensions. Used in commercial projects and open-source projects in production. Designed for users first, for both frontfacing app usecases and developer usecases. Highly optimized. Compatible with All modern browsers. Legacy browsers with ES5 and canvas support. ES5 and canvas support are required, and feature detection is used for optional performance enhancements. Browsers circa 2012 support ES5 fully: IE10, Chrome 23, Firefox 21, Safari 6 (caniuse). Browsers with partial but sufficient ES5 support also work, such as IE9 and Firefox 4. The documentation and examples are not optimized for old browsers, although the library itself is. Some demos may not work in old browsers in order to keep the demo code simple.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 3
    Elementary

    Elementary

    Open-source data observability for analytics engineers

    Elementary is an open-source data observability solution for data & analytics engineers. Monitor your dbt project and data in minutes, and be the first to know of data issues. Gain immediate visibility, detect data issues, send actionable alerts, and understand the impact and root cause. Generate a data observability report, host it or share with your team. Monitoring of data quality metrics, freshness, volume and schema changes, including anomaly detection. Elementary data monitors are configured and executed like native tests in dbt your project. Uploading and modeling of dbt artifacts, run and test results to tables as part of your runs. Get informative notifications on data issues, schema changes, models and tests failures. Inspect upstream and downstream dependencies to understand impact and root cause of data issues.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 4
    Fluid

    Fluid

    Fluid, elastic data abstraction and acceleration for BigData/AI apps

    Fluid, elastic data abstraction and acceleration for BigData/AI applications in the cloud. Provide DataSet abstraction for underlying heterogeneous data sources with multidimensional management in a cloud environment. Enable dataset warmup and acceleration for data-intensive applications by using a distributed cache in Kubernetes with observability, portability, and scalability. Taking characteristics of application and data into consideration for cloud application/dataset scheduling to improve the performance.
    Downloads: 1 This Week
    Last Update:
    See Project
  • Simple, Secure Domain Registration Icon
    Simple, Secure Domain Registration

    Get your domain at wholesale price. Cloudflare offers simple, secure registration with no markups, plus free DNS, CDN, and SSL integration.

    Register or renew your domain and pay only what we pay. No markups, hidden fees, or surprise add-ons. Choose from over 400 TLDs (.com, .ai, .dev). Every domain is integrated with Cloudflare's industry-leading DNS, CDN, and free SSL to make your site faster and more secure. Simple, secure, at-cost domain registration.
    Sign up for free
  • 5
    IoTDB

    IoTDB

    Apache IoTDB

    Apache IoTDB (Database for Internet of Things) is an IoT native database with high performance for data management and analysis, deployable on the edge and the cloud. Due to its light-weight architecture, high performance and rich feature set together with its deep integration with Apache Hadoop, Spark and Flink, Apache IoTDB can meet the requirements of massive data storage, high-speed data ingestion and complex data analysis in the IoT industrial fields. In the scene of factories, there are tens of devices under LAN network. IoTDB can be installed on a local controller server in the factory to receive data from those devices. The local controller server (normal PC or workstation) with IoTDB can provide the ability to persist data and query data with SQL-like interface. In addition, with TsFile-Sync tool, TsFiles on the local controller can be transmitted to the data center equipped with IoTDB instance in the cloud.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 6
    ODD Platform

    ODD Platform

    First open-source data discovery and observability platform

    Unlock the power of big data with OpenDataDiscovery Platform. Experience seamless end-to-end insights, powered by unprecedented observability and trust - from ingestion to production - while building your ideal tech stack! Democratize data and accelerate insights. Find data that fits your use case and discover hints left by your peers to leverage existing knowledge. Explore tags, ownership details, links to other sources and other information to shorten and simplify data discovery phase. Forget unnerved stakeholders and wasting too much time on digging the root cause of data issues when it fails. With ODD’s automatic company-wide ingestion-to-product lineage you’ll have answers in just seconds and stakeholders won’t need to wait. Sleep well, knowing all your data is in check. Forget manual testing, days of debugging, and weeks of worrying. Know the impact of each code change with automatic testing. Enjoy lineage and alerts powered with data quality information.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 7
    xsv

    xsv

    A fast CSV command line toolkit written in Rust

    xsv is a command line program for indexing, slicing, analyzing, splitting and joining CSV files. Commands should be simple, fast and composable. Simple tasks should be easy. Performance trade offs should be exposed in the CLI interface. Composition should not come at the expense of performance. Let's say you're playing with some of the data from the Data Science Toolkit, which contains several CSV files. Maybe you're interested in the population counts of each city in the world. So grab the data and start examining it. The next thing you might want to do is get an overview of the kind of data that appears in each column. The stats command will do this for you. The xsv table command takes any CSV data and formats it into aligned columns using elastic tabstops. These commands are instantaneous because they run in time and memory proportional to the size of the slice (which means they will scale to arbitrarily large CSV data).
    Downloads: 1 This Week
    Last Update:
    See Project
  • 8
    fsv, the 3D File System Visualizer, allows you to view a collection of directories and files as three-dimensional geometry. It represents all or part of a filesystem as a collection of blocks of varying sizes, each labeled with a filename, and arranged in
    Leader badge
    Downloads: 22 This Week
    Last Update:
    See Project
  • 9
    AWA-Core

    AWA-Core

    Full application for factory, process engineer and Automation..

    NEW -- NEW -- NEW -- NEW -- NEW AWA-Core 2025 is coming with a totally new architecture. The core is now in Client/Server architecture and open to other applications. New interfaces for the server and client sides. Stay tuned !! AWA-Core (Another Way of Automation) is a complete suite that allows engineers, PLC programmers and factory designers to create huge projects for retrieving data, creating graphics, automatic scripts, exports and data links. You can easily manage AWA-Core and it's easier than Historian softwares.
    Downloads: 16 This Week
    Last Update:
    See Project
  • Build Securely on AWS with Proven Frameworks Icon
    Build Securely on AWS with Proven Frameworks

    Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

    Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.
    Download Now
  • 10
    UPDATE: Chromium is no longer updated or maintained. The project is frozen. Chromium is a flexible framework for scalable real-time rendering on clusters of workstations, derived from the Stanford WireGL project code base.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 11
    HPCC Systems

    HPCC Systems

    End-to-end big data in a massively scalable supercomputing platform.

    HPCC Systems® (www.hpccsystems.com) from LexisNexis® Risk Solutions is a proven, open source solution for Big Data insights that can be implemented by businesses of all sizes. With HPCC Systems, developers can design applications with Big Data at their core, enabling businesses to better analyze and understand data at scale, improving business time to results and decisions. HPCC Systems offers a consistent data-centric programming language, two processing platforms and a single, complete end-to-end architecture for efficient processing. Read our blog (http://hpccsystems.com/blog ), or connect with us on Twitter (@hpccsystems), Facebook (https://www.facebook.com/hpccsystems ) and LinkedIn (http://www.linkedin.com/company/hpcc-systems) HPCC Systems is available on AWS & can be configured through the Instant Cloud Solution.
    Leader badge
    Downloads: 5 This Week
    Last Update:
    See Project
  • 12
    CanFestival focuses on providing an ANSI-C platform independent CANOpen stack that can be implemented as master or slave nodes on PCs, Real-time IPCs, and Microcontrollers. A WxPython based CanOpen Object Dictionary editor GUI is also provided.
    Downloads: 15 This Week
    Last Update:
    See Project
  • 13
    Equalizer - Parallel Rendering
    Equalizer is the standard middleware to create parallel OpenGL-based applications. Please visit https://github.com/Eyescale for current development information.
    Leader badge
    Downloads: 11 This Week
    Last Update:
    See Project
  • 14
    DocWire SDK

    DocWire SDK

    Award-winning modern data processing SDK in C++20

    DocWire SDK, a standout C++20AI driven data processing tool, has received award from SourceForge and strong backing from Microsoft. It handles nearly 100 file types, empowering efficient text extraction, web data extraction, and document analysis. For businesses, the shift to DocWire SDK signifies a leap forward. It promises comprehensive document format support and the ability to extract valuable insights from email boxes, databases, and websites using cutting-edge AI. DocWire SDK aims to expand its capabilities, focusing on versatile data extraction, platform support, and seamless integration with various systems. DocWire SDK is dedicated to streamlining data processing, reducing development time and costs, and harnessing the potential of AI. Its advancements promise a superior experience compared to its predecessor, DocToText.
    Leader badge
    Downloads: 9 This Week
    Last Update:
    See Project
  • 15
    Ubix Linux

    Ubix Linux

    The Pocket Datalab

    Ubix stands for Universal Business Intelligence Computing System. Ubix Linux is an open-source, Debian-based Linux distribution geared towards data acquisition, transformation, analysis and presentation. Ubix Linux purpose is to offer a tiny but versatile datalab. Ubix Linux is easily accessible, resource-efficient and completely portable on a simple USB key. Ubix Linux is a perfect toolset for learning data analysis and artificial intelligence basics on small to medium datasets. You can find additional information, technical guidance, and user credentials on the project website https://ubix-linux.sourceforge.io/ or on the project subreddit https://reddit.com/r/UbixLinux.
    Downloads: 9 This Week
    Last Update:
    See Project
  • 16
    Repository for open source software re-packaged into MSI packages for enterprise deployment.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 17
    Bootchart is a tool for performance analysis and visualization of the GNU/Linux boot process. Resource utilization data and process information are collected during the boot process and can later be displayed in a PNG, SVG or EPS chart.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 18
    DatacenterManager

    DatacenterManager

    UNIX Performance Monitoring / Trend Analysis Java Software

    Remotely Inventory and Poll UNIX servers in seconds. (without installing extra software on your servers, just by SSH communication plain old UNIX commands).https://sites.google.com/site/ronuitzaandam/ Your entire datacenter can be automatically inventoried by supplying hostname, username & password for each server, either “one by one” or via an automated CSV host-list import file. This software goes great with other UNIX software like WinSCP and Putty etc !!!
    Downloads: 2 This Week
    Last Update:
    See Project
  • 19
    Portable Linux

    Portable Linux

    Portable Ubuntu Linux for Scientific Computing

    Released August 22, 2013 Lubuntu Blends: Biochemistry 13.04 (Raring) v5.44 Linux Kernel Image 3.8.0-29 Lubuntu Blends are pre-installed Wubi disk image remixes of Ubuntu and Debian Science meta packages, A custom boot loader allows installations to be copied and automatically booted from most external or USB flash drives. Once up and running, use earlier Lubuntu Remix README instructions here until documentation is updated. https://sourceforge.net/projects/portable-linux/files/ Installation 1. Download the Wubi installer http://releases.ubuntu.com/saucy/wubi.exe 2. Install any flavor of Ubuntu. 3. Swap out the root.disk with the ones provided here. Overview LAMP stack running on localhost (127.0.0.1). Scientific, productivity & media packages include R (Rattle Data Miner), GridEngine, Condor, cooperative computing tools, WINE, LibreOffice, Evolution, Clinica, Neuro Debian Desktop, PsycoPy, OpenVibe, 3DSlicer, Paraview, Openshot. Cheers, Gregory Remington
    Downloads: 3 This Week
    Last Update:
    See Project
  • 20
    A base for programs. Includes algorythms for Q-learning and SOM's etc. too. Examples: Hamron: Simulates evolution, uses the 2D-renderer. DriveUnit: created for school, for a robotic arm, uses the 3D-renderer. Hlearn: http://www.sagenb.org/home/pub/8
    Downloads: 5 This Week
    Last Update:
    See Project
  • 21
    Talend Spatial Module (aka Spatial Data Integrator or SDI) is an ETL tool for geospatial. Based on Talend Open Studio, input, output and transform geocomponents are available. IO components read/write GIS formats(eg.PostGIS, GeoRSS). Transformers all
    Downloads: 5 This Week
    Last Update:
    See Project
  • 22
    tnv
    TNV visualizes pcap data to graphically depict network packets, links, and ports for network traffic analysis to facilitate learning what constitutes 'normal' behavior, investigating security events, or network troubleshooting.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 23

    ELock

    Prevent unauthorised access to operating system

    The ELock program allows administrator to block access to operating system on Windows CE5.0, CE6.0 devices. It allows to control running programs. The functionality is similar to AppCenter (which works only on Motorola devices). Works on Motorola, Casio, Datalogic and others devices. Program ELock pozwala administratorowi zablokować dostęp użytkownika do opcji systemu na urządzeniach z systemem Windows CE5.0, CE6.0. Umożliwia kontrolę uruchamianych programów. Jest programem podobnym do AppCenter (pracującym jedynie na terminalach Motoroli). Działa między innymi na terminalach Motoroli, Casio, Datalogic.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 24
    Streams for IBM i

    Streams for IBM i

    Batch performance boosting and Big Data framework for IBM i

    Streamd for IBM i is a suite of tools for IBM i (previously known as AS/400 and iSeries) that can significantly improve performance characteristics of batch processes. Due to extensive use of parallel programming techniques Streams for IBM i delivers significant performance improvements for single streamed batch jobs. Streams for IBM i can split an existing batch process into a number of concurrent streams, completely eliminate backup-related delays, introduce new robust recovery policies and even modify the program logic of existing applications - all without any code modifications. Streams for IBM i includes a feature allowing manipulations of batch job QTEMP libraries.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 25
    Netsim is a mobile ad hoc network simulator targeted at large heterogeneous node configurations. It is written in Java and is easily extensible through its modular concept.
    Downloads: 4 This Week
    Last Update:
    See Project
Want the latest updates on software, tech news, and AI?
Get latest updates about software, tech news, and AI from SourceForge directly in your inbox once a month.