Skip to content
#

pig

Here are 33 public repositories matching this topic...

Working on a batch analytics pipeline using Hortonworks HDP 2.6.5. Include loading data into HDFS, creating schemas, using Pig and Hive for transformations, running a MapReduce job, and building PySpark models for clustering, classification, and regression. NLP and sentiment analysis, reduce features using PCA or SVD, and graph analysis applied.

  • Updated Dec 31, 2025
  • Python

Improve this page

Add a description, image, and links to the pig topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the pig topic, visit your repo's landing page and select "manage topics."

Learn more