0% found this document useful (0 votes)

63 views2 pages

Capstone 2 Corizo

Uploaded by

avinashsharma231500

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

63 views2 pages

Capstone 2 Corizo

Uploaded by

avinashsharma231500

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 2

CAPSTONE PROJECT 2

DOMAIN: Semiconductor manufacturing process

• CONTEXT: A complex modern semiconductor manufacturing process is normally

under constant surveillance via the monitoring of signals variables collected from
sensors and or process measurement points. However, not all of these signals are
equally valuable in a specific monitoring system. The measured signals contain a
combination of useful information, irrelevant information as well as noise. Engineers
typically have a much larger number of signals than are required. If we consider each
type of signal as a feature, then feature selection may be applied to identify the most
relevant signals. The Process Engineers may then use these signals to determine key
factors contributing to yield excursions downstream in the process. This will enable an
increase in process throughput, decreased time to learning and reduce the per unit
production costs. These signals can be used as features to predict the yield type. And by
analysing and trying out different combinations of features, essential signals that are
impacting the yield type can be identified.

• DATA DESCRIPTION: sensor-data.csv : (1567, 592)

The data consists of 1567 examples each with 591 features.

The dataset presented in this case represents a selection of such features where each
example represents a single production entity with associated measured features and
the labels represent a simple pass/fail yield for in house line testing. Target column “ –1”
corresponds to a pass and “1” corresponds to a fail and the data time stamp is for that
specific test point.

• PROJECT OBJECTIVE: We will build a classifier to predict the Pass/Fail yield of a

particular process entity and analyse whether all the features are required to build the
model or not.

Steps and tasks:

1. Import and explore the data.

2. Data cleansing:

• Missing value treatment.

• Drop attribute/s if required using relevant functional knowledge.

• Make all relevant modifications on the data using both functional/logical

reasoning/assumptions.

3. Data analysis & visualisation:

• Perform detailed relevant statistical analysis on the data.

• Perform a detailed univariate, bivariate and multivariate analysis with appropriate
detailed comments after each analysis.

4. Data pre-processing:

• Segregate predictors vs target attributes

• Check for target balancing and fix it if found imbalanced (read SMOTE)

• Perform train-test split and standardise the data or vice versa if required.

• Check if the train and test data have similar statistical characteristics when compared
with original data.

5. Model training, testing and tuning:

• Model training:

- Pick up a supervised learning model.

- Train the model.

- Use cross validation techniques.

- Apply GridSearch hyper-parameter tuning techniques to get the best accuracy.

Suggestion: Use all possible hyper parameter combinations to extract the best
accuracies.

- Use any other technique/method which can enhance the model performance.

Hint: Dimensionality reduction, attribute removal, standardisation/normalisation,

target balancing etc.

- Display and explain the classification report in detail.

- Apply the above steps to atleast 3 different kind of models that you have learnt so far
and models that you haven't learned till now (Randomforest, SVM, Naive bayes etc).

• Display and compare all the models designed with their train and test accuracies.

• Select the final best trained model along with your detailed comments for selecting
this model.

• Save the selected model for future use.

6. Conclusion and improvisation:

• Write your conclusion on the results

FMT - Problem - Statement
No ratings yet
FMT - Problem - Statement
2 pages
ML Checklist PDF
No ratings yet
ML Checklist PDF
4 pages
Mid-Term Project (Stroke Risk Classification)
No ratings yet
Mid-Term Project (Stroke Risk Classification)
3 pages
CE802 Pilot
No ratings yet
CE802 Pilot
2 pages
Machine Learning Project Checklist
No ratings yet
Machine Learning Project Checklist
30 pages
Data Science Checklist
No ratings yet
Data Science Checklist
22 pages
Assignment
No ratings yet
Assignment
5 pages
Machine Learning Team Coursework
No ratings yet
Machine Learning Team Coursework
7 pages
Predictive Maintenance for Wind Turbines
No ratings yet
Predictive Maintenance for Wind Turbines
5 pages
CE802 Report
No ratings yet
CE802 Report
7 pages
What Does This File Say - What Should I Do - I Have
No ratings yet
What Does This File Say - What Should I Do - I Have
14 pages
Project Report G00061
No ratings yet
Project Report G00061
15 pages
Python - Project 2 Problem Statement
No ratings yet
Python - Project 2 Problem Statement
3 pages
A3 Classification and Feature Engineering
No ratings yet
A3 Classification and Feature Engineering
2 pages
ML Project
No ratings yet
ML Project
5 pages
Project - Machine Learning-Business Report: By: K Ravi Kumar PGP-Data Science and Business Analytics (PGPDSBA.O.MAR23.A)
No ratings yet
Project - Machine Learning-Business Report: By: K Ravi Kumar PGP-Data Science and Business Analytics (PGPDSBA.O.MAR23.A)
38 pages
Credit Card Approval Prediction Report-Final
No ratings yet
Credit Card Approval Prediction Report-Final
27 pages
Machine Learning Project Checklist
100% (1)
Machine Learning Project Checklist
10 pages
Instructor:: Semester Project Mam. Yella Mehroze
No ratings yet
Instructor:: Semester Project Mam. Yella Mehroze
7 pages
CH 3
No ratings yet
CH 3
33 pages
Project Report-Micro Credit Loan
No ratings yet
Project Report-Micro Credit Loan
8 pages
Untitled Document
No ratings yet
Untitled Document
4 pages
Ce473 Project - Fall 2024
No ratings yet
Ce473 Project - Fall 2024
8 pages
Capstone Project - Jaro-Prof. Babji
No ratings yet
Capstone Project - Jaro-Prof. Babji
5 pages
Cyber Cafe Management System DEEPAK SHINDE
No ratings yet
Cyber Cafe Management System DEEPAK SHINDE
36 pages
Predictive Modelling ALOK KUMAR
100% (1)
Predictive Modelling ALOK KUMAR
25 pages
Credit Risk Project
No ratings yet
Credit Risk Project
11 pages
Features Selection and Featurs Generation
No ratings yet
Features Selection and Featurs Generation
5 pages
How A Perfect Machine Model Should Be Done
No ratings yet
How A Perfect Machine Model Should Be Done
5 pages
Asiign2 Aaryan Ai
No ratings yet
Asiign2 Aaryan Ai
11 pages
Lecture 13
No ratings yet
Lecture 13
39 pages
Project On Data Mining: Prepared by Ashish Pavan Kumar K PGP-DSBA at Great Learning
No ratings yet
Project On Data Mining: Prepared by Ashish Pavan Kumar K PGP-DSBA at Great Learning
50 pages
Machine Learning for Professionals
No ratings yet
Machine Learning for Professionals
26 pages
Report
No ratings yet
Report
2 pages
ML Project
No ratings yet
ML Project
11 pages
Subject - Machine Learning Group - E27-24 Name
No ratings yet
Subject - Machine Learning Group - E27-24 Name
18 pages
Asiign2 Smith
No ratings yet
Asiign2 Smith
10 pages
Pa Unit 4
No ratings yet
Pa Unit 4
5 pages
Project Descr
No ratings yet
Project Descr
2 pages
Assignment 2 Mufan
No ratings yet
Assignment 2 Mufan
9 pages
Capstone Project Guidelines
No ratings yet
Capstone Project Guidelines
2 pages
Technical Assignment 2
No ratings yet
Technical Assignment 2
3 pages
4aa. Hw-5.knit
No ratings yet
4aa. Hw-5.knit
1 page
DM Assignment 2
No ratings yet
DM Assignment 2
2 pages
Example 2 SPM Lec#1
No ratings yet
Example 2 SPM Lec#1
3 pages
Important Questions
No ratings yet
Important Questions
4 pages
Unit-4 Data Mining
No ratings yet
Unit-4 Data Mining
19 pages
C1 W2
No ratings yet
C1 W2
60 pages
AIML Short Term Internship Session 10 Summary-1719293295226
No ratings yet
AIML Short Term Internship Session 10 Summary-1719293295226
3 pages
Steps To Create Data Sets and Developing A Machine Learning Model
No ratings yet
Steps To Create Data Sets and Developing A Machine Learning Model
3 pages
Unit 3
No ratings yet
Unit 3
28 pages
Final-Term Project Topics
No ratings yet
Final-Term Project Topics
4 pages
Lec 2
No ratings yet
Lec 2
13 pages
Flight Price Prediction Report
No ratings yet
Flight Price Prediction Report
18 pages
Phase-2 For DS
No ratings yet
Phase-2 For DS
6 pages
Assignment
No ratings yet
Assignment
5 pages
Machine Leaning
No ratings yet
Machine Leaning
29 pages
Assignment 2
No ratings yet
Assignment 2
3 pages
CHAPTER 4 Diabetes
No ratings yet
CHAPTER 4 Diabetes
6 pages
Account STMT XX6282 16072025
No ratings yet
Account STMT XX6282 16072025
3 pages
English 210 - Team Charter Team 4
No ratings yet
English 210 - Team Charter Team 4
3 pages
1 Public Transportation System in Pokhara1
100% (1)
1 Public Transportation System in Pokhara1
9 pages
Global E&M Outlook 2023–2027 Insights
No ratings yet
Global E&M Outlook 2023–2027 Insights
30 pages
Developing Serious Games Game Development Series 1st Edition Bryan Bergeron PDF Download
No ratings yet
Developing Serious Games Game Development Series 1st Edition Bryan Bergeron PDF Download
76 pages
Servomill® 700 Datasheet
No ratings yet
Servomill® 700 Datasheet
3 pages
System Software Module 1
No ratings yet
System Software Module 1
93 pages
The Impact of Digitization On Business Models - A Systematic Literature Review
No ratings yet
The Impact of Digitization On Business Models - A Systematic Literature Review
3 pages
Marketing Project Report Sample
No ratings yet
Marketing Project Report Sample
106 pages
HCR Remote Control MV3
No ratings yet
HCR Remote Control MV3
19 pages
DSE 20.1F Computer Architecture and Networks
No ratings yet
DSE 20.1F Computer Architecture and Networks
3 pages
Mgt4216e Strategic Management - Ia
No ratings yet
Mgt4216e Strategic Management - Ia
6 pages
Cleaning Unit BOLL Ultrasonic TYPE 5.05: Effective Cleaning of Filter Elements
100% (1)
Cleaning Unit BOLL Ultrasonic TYPE 5.05: Effective Cleaning of Filter Elements
2 pages
PG3 User
No ratings yet
PG3 User
18 pages
7M-2 Plate Heat Exchanger
No ratings yet
7M-2 Plate Heat Exchanger
18 pages
Da 66we English
No ratings yet
Da 66we English
2 pages
Hydraulic Compressor for Power Lines
No ratings yet
Hydraulic Compressor for Power Lines
11 pages
e Learning Project Report
No ratings yet
e Learning Project Report
28 pages
Myci 2013 Wip
No ratings yet
Myci 2013 Wip
4 pages
Denon Pma-710ae Service Manual
No ratings yet
Denon Pma-710ae Service Manual
35 pages
QA VTMmanual
No ratings yet
QA VTMmanual
75 pages
A Theory of Generic Interpreters
No ratings yet
A Theory of Generic Interpreters
14 pages
The First Microfinance Bank - Afghanistan
No ratings yet
The First Microfinance Bank - Afghanistan
21 pages
Sample Prob 1 2 Crashing
No ratings yet
Sample Prob 1 2 Crashing
4 pages
HL 510B Omsu80
100% (1)
HL 510B Omsu80
11 pages
Trust Bank Trainee Exam Notice
No ratings yet
Trust Bank Trainee Exam Notice
1 page
Mamta Tiwari-Sr Talent Acquisition Specialist
No ratings yet
Mamta Tiwari-Sr Talent Acquisition Specialist
5 pages
Replace Rocker Arm Cover Gasket - ctm415 - Service ADVISOR™
100% (1)
Replace Rocker Arm Cover Gasket - ctm415 - Service ADVISOR™
5 pages
Tessa Presents 90 Maps For Descent Into Avernus Roll 20
No ratings yet
Tessa Presents 90 Maps For Descent Into Avernus Roll 20
17 pages
PSA RA 2 en Repair Shell Cracks Near Rectangular Beams - Kelvion
No ratings yet
PSA RA 2 en Repair Shell Cracks Near Rectangular Beams - Kelvion
9 pages

Capstone 2 Corizo

Uploaded by

Capstone 2 Corizo

Uploaded by

CAPSTONE PROJECT 2

DOMAIN: Semiconductor manufacturing process

• CONTEXT: A complex modern semiconductor manufacturing process is normally

• DATA DESCRIPTION: sensor-data.csv : (1567, 592)

The data consists of 1567 examples each with 591 features.

• PROJECT OBJECTIVE: We will build a classifier to predict the Pass/Fail yield of a

Steps and tasks:

1. Import and explore the data.

• Missing value treatment.

• Drop attribute/s if required using relevant functional knowledge.

• Make all relevant modifications on the data using both functional/logical

3. Data analysis & visualisation:

• Perform detailed relevant statistical analysis on the data.

• Segregate predictors vs target attributes

5. Model training, testing and tuning:

- Pick up a supervised learning model.

- Train the model.

- Use cross validation techniques.

- Apply GridSearch hyper-parameter tuning techniques to get the best accuracy.

Hint: Dimensionality reduction, attribute removal, standardisation/normalisation,

- Display and explain the classification report in detail.

• Save the selected model for future use.

6. Conclusion and improvisation:

• Write your conclusion on the results

You might also like