Skip to content
#

scalatest

Here are 176 public repositories matching this topic...

This project implements a distributed pipeline for NLP model training using Apache Spark and DeepLearning4J (DL4J). The methodology utilizes a sliding window approach for data preparation, positional embeddings for token encoding, and Word2Vec model training with parallel processing. The model and training process is designed for scalability and op

  • Updated Aug 26, 2025
  • Scala

Improve this page

Add a description, image, and links to the scalatest topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the scalatest topic, visit your repo's landing page and select "manage topics."

Learn more