Data-Driven Soccer Scouting

This document outlines a project to use data mining techniques on FIFA 19 player data to help soccer clubs scout for players. The goals are to identify undervalued players, analyze current rosters for over/underperformers, build a similarity database for player comparisons, and develop predictive models for future player potential/value. The team will cluster, analyze, and build models on the FIFA 19 dataset containing attributes for over 18,000 real players to achieve these objectives.

Uploaded by

Mauricio Peñaloza

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

231 views3 pages

Data-Driven Soccer Scouting

Uploaded by

Mauricio Peñaloza

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

Scouting Players with FIFA19

Applying Data Mining to Scouting

Data Driven Approach to Scouting

In the era of eight-figure salaries and nine figure signing fees, player recruitment is a
high-stakes game. In the past, soccer scouts have relied on rudimentary data and intuition to
evaluate the performance and value of soccer players. With the recent rise in data analytics
that can capture many aspects of a player’s performance, statistics and data science are
beginning to play a more prominent role in identifying rising stars and overvalued /
undervalued players.

For this project, we are positioning ourselves as a scouting agency that uses analytics to,
among other things, enhance the discovery of talents and help soccer clubs better understand
the dynamics (features) that come into play when determining the value, overall and future
potential of a player. Our agency will be focusing on solving these fundamental scouting
problems:

1. Finding undervalued players for a given club to acquire,

2. Analyzing a team’s current roster for over-payed and/or underperforming
players that could be traded or sold,
3. Developing a database of similar players for clubs looking for a specific player
type,
4. Build a predictive model to evaluate the future potential of young players.

We will be utilizing the FIFA 19 Player dataset available on Kaggle and apply various Data
Mining techniques to achieve our objectives.

Project Objectives
• Cluster players based various features to identify different player types for our similarity
database.
• Identify under-valued and over-valued players based on ability measures relative to
their value, salary, and/or release clause.
• Building predictive models for future value and potential of players.

Dataset
• Source: Kaggle
• Description: Detailed attributes for every player registered in the latest edition of FIFA
2019 database.
• Size: 9.1MB (18.2k observations x 89 features)
• Features:

1
• ID • Value • Joined
• Name • Wage • Loaned From
• Age • Special • Contract Valid Until
• Photo • Preferred Foot • Height
• Nationality • International Reputation • Weight
• Overall • Weak Foot • Ability by positions (26 features)
• Potential • Skill Moves • Ability by skills (34 features)
• Club • Work Rate • Release Clause
• Position • Jersey Number

Team & Roles

• Markus Wehr: Finding undervalued players.
• Nazih Kalo: Analyzing current roster of players.
• Stephen Stark: Developing similarity database.
• Tam Nguyen: Predictive model for future potential/value.
• Woo Jong Choi: Predictive model for future potential/value.

Data Mining Steps:

• Missing value, data type
Data pre-
• Features distribution
processing
• Feature engineering
1. Pre-processing and EDA
2. Clustering
Analysis
3. Build predictive models
Stages
4. Analyze performance & make final predictions
5. Visualize Output
• PCA
• t-SNE
• K-means
• DBSCAN
• SVD
• Regression: linear/ logit
Potential
• Hierarchical Clustering
Methods
• Latent Class Clustering
• Discriminant Analysis
• Regression Trees
• Random forest
• Decision trees
• Association rules
1. Microsoft Teams
Tools 2. Python
− Jupyter Notebook, Google Collab

2
− Pandas, Numpy, Matplotlib, Seaborn, Scikit-learn, Scipy
3. Tableau

Applying Data Mining To Scouting: Markus Wehr Nazih Kalo Stephen Stark Tam Nguyen Woojong Choi
No ratings yet
Applying Data Mining To Scouting: Markus Wehr Nazih Kalo Stephen Stark Tam Nguyen Woojong Choi
38 pages
Rajesh 2020
No ratings yet
Rajesh 2020
9 pages
Data Driven Football Scouting Assistance With Simulated Player Performance Extrapolation
No ratings yet
Data Driven Football Scouting Assistance With Simulated Player Performance Extrapolation
8 pages
Application of Different Model Algorithm in The Prediction of Transfer Fee of Soccer Players
No ratings yet
Application of Different Model Algorithm in The Prediction of Transfer Fee of Soccer Players
9 pages
A Survey On Football Player Performance and Value Estimation Using Machine Learning Techniques (#1215552) - 2816789
No ratings yet
A Survey On Football Player Performance and Value Estimation Using Machine Learning Techniques (#1215552) - 2816789
6 pages
Football Data Insights for Clubs
No ratings yet
Football Data Insights for Clubs
27 pages
Problem Statement - PBI - Docx-1
No ratings yet
Problem Statement - PBI - Docx-1
1 page
Playerank: Data-Driven Performance Evaluation and Player Ranking in Soccer Via A Machine Learning Approach
No ratings yet
Playerank: Data-Driven Performance Evaluation and Player Ranking in Soccer Via A Machine Learning Approach
27 pages
Developing A Reliable Hybrid Machine Learning Model For Objective Soccer Player Valuation
No ratings yet
Developing A Reliable Hybrid Machine Learning Model For Objective Soccer Player Valuation
13 pages
DS Project Docs
No ratings yet
DS Project Docs
17 pages
Transfer Portal Accurately Forecasting The Impact of A 2201.11533
No ratings yet
Transfer Portal Accurately Forecasting The Impact of A 2201.11533
25 pages
Player Stats Analysis Using Machine Learning
No ratings yet
Player Stats Analysis Using Machine Learning
4 pages
Predictthe Valueof Football Players Using FIFAvideogamedataand Machine Learning Techniques
No ratings yet
Predictthe Valueof Football Players Using FIFAvideogamedataand Machine Learning Techniques
16 pages
Money Ball
No ratings yet
Money Ball
8 pages
Problem Statement - PBI
No ratings yet
Problem Statement - PBI
1 page
Problem Statement - FIFA
No ratings yet
Problem Statement - FIFA
1 page
Predicting Football Transfer Values
No ratings yet
Predicting Football Transfer Values
6 pages
2212.11041-What Should Clubs Monitor To Predict Future Value of Footbal Players
No ratings yet
2212.11041-What Should Clubs Monitor To Predict Future Value of Footbal Players
22 pages
Business Analytics in Sport Talent Acquisition Met
No ratings yet
Business Analytics in Sport Talent Acquisition Met
20 pages
FIFA 18 - Data Analysis: - Harsh Takrani - Pranay Lulla
No ratings yet
FIFA 18 - Data Analysis: - Harsh Takrani - Pranay Lulla
16 pages
Predict The Value of Football Players Using FIFA Video Game Data and Machine Learning Techniques
No ratings yet
Predict The Value of Football Players Using FIFA Video Game Data and Machine Learning Techniques
15 pages
57 - Step PPT 2 Cpr3 Final
No ratings yet
57 - Step PPT 2 Cpr3 Final
32 pages
Football Player Performance Prediction
No ratings yet
Football Player Performance Prediction
6 pages
2018 - BARRON - Artificial Neural Networks and Player Recruitment in Professional Soccer
No ratings yet
2018 - BARRON - Artificial Neural Networks and Player Recruitment in Professional Soccer
11 pages
2020-21 Fall 41553 Bernardo-Pinto
No ratings yet
2020-21 Fall 41553 Bernardo-Pinto
49 pages
FIFA Report
No ratings yet
FIFA Report
10 pages
6 powerBI Project PDF
100% (3)
6 powerBI Project PDF
16 pages
DEM Project Report
No ratings yet
DEM Project Report
7 pages
Player Ank
No ratings yet
Player Ank
18 pages
PlayeRank: Data-Driven Performance Evaluation and Player Ranking in Soccer Via A Machine Learning Approach
No ratings yet
PlayeRank: Data-Driven Performance Evaluation and Player Ranking in Soccer Via A Machine Learning Approach
18 pages
Football Market Value Prediction
No ratings yet
Football Market Value Prediction
19 pages
Untitled Document 61
No ratings yet
Untitled Document 61
12 pages
INFO Assignment 1
No ratings yet
INFO Assignment 1
6 pages
Rating Prediction of Football Players Using Machine Learning
No ratings yet
Rating Prediction of Football Players Using Machine Learning
6 pages
FIFA Video Game - Players Classification
No ratings yet
FIFA Video Game - Players Classification
26 pages
Cap484 Final Project
No ratings yet
Cap484 Final Project
8 pages
Soccerment TheClusteringProject ENG 20220615 PDF
100% (1)
Soccerment TheClusteringProject ENG 20220615 PDF
158 pages
Handbook Fa
100% (1)
Handbook Fa
27 pages
Football Analytics for Clubs
No ratings yet
Football Analytics for Clubs
27 pages
Data Science Methodology in Football Players Recruitment
No ratings yet
Data Science Methodology in Football Players Recruitment
2 pages
Entropy 23 00090 v3
No ratings yet
Entropy 23 00090 v3
12 pages
Artificial Neural Networks and Player Recruitment in Professional Soccer
No ratings yet
Artificial Neural Networks and Player Recruitment in Professional Soccer
8 pages
Additional Project Problem Statement - FIFA Data Analysis
No ratings yet
Additional Project Problem Statement - FIFA Data Analysis
2 pages
Machine Learning-Driven Market Value Prediction For European Football Players
No ratings yet
Machine Learning-Driven Market Value Prediction For European Football Players
17 pages
Super Bowl
No ratings yet
Super Bowl
10 pages
Fantasy Sports Prediction Clustering Analysis
No ratings yet
Fantasy Sports Prediction Clustering Analysis
21 pages
Football Match Prediction Methods
No ratings yet
Football Match Prediction Methods
59 pages
Project - Management - PPT Final
No ratings yet
Project - Management - PPT Final
18 pages
Football Talent Scouting System
No ratings yet
Football Talent Scouting System
2 pages
Whitepaper The Soccer Analytics Revolution 1
No ratings yet
Whitepaper The Soccer Analytics Revolution 1
10 pages
NBA Data Analytics & Visualization
No ratings yet
NBA Data Analytics & Visualization
17 pages
Football Pass Valuation Models
No ratings yet
Football Pass Valuation Models
73 pages
Professional Scouting of Talents As The Inevitable Component of Modern Football
No ratings yet
Professional Scouting of Talents As The Inevitable Component of Modern Football
12 pages
CapstoneSynopsis A
No ratings yet
CapstoneSynopsis A
6 pages
Usage of Analytics in The World of Sports
No ratings yet
Usage of Analytics in The World of Sports
7 pages
Crafting A Player Impact Metric Through Analysis of Football Match Event Data
No ratings yet
Crafting A Player Impact Metric Through Analysis of Football Match Event Data
15 pages
ML in Soccer Analytics Gunjan Kumar
No ratings yet
ML in Soccer Analytics Gunjan Kumar
99 pages
Comprehensive Analysis of Football Player Market V
No ratings yet
Comprehensive Analysis of Football Player Market V
7 pages
Tor-V1 6
No ratings yet
Tor-V1 6
144 pages
AS Series Module Guide
No ratings yet
AS Series Module Guide
486 pages
SAFe LPM 100 MCQs
No ratings yet
SAFe LPM 100 MCQs
22 pages
D&D 5e Statblock Generator
No ratings yet
D&D 5e Statblock Generator
1 page
How To Make A Complete Map of Every Thought You Think
No ratings yet
How To Make A Complete Map of Every Thought You Think
61 pages
Syllabus 201320 78
No ratings yet
Syllabus 201320 78
5 pages
Lock 101
No ratings yet
Lock 101
4 pages
10 - Mini Projects
No ratings yet
10 - Mini Projects
20 pages
EE 332 Lab 3
No ratings yet
EE 332 Lab 3
10 pages
Kallam Haranadha Reddy Institute of Technology - Faculty
No ratings yet
Kallam Haranadha Reddy Institute of Technology - Faculty
70 pages
Nammcesa 000033 PDF
No ratings yet
Nammcesa 000033 PDF
2,112 pages
HP Color LaserJet 9500 9500 MFP Service Manual
No ratings yet
HP Color LaserJet 9500 9500 MFP Service Manual
574 pages
Random Quote Generator1
No ratings yet
Random Quote Generator1
11 pages
TIA EIA 568 B.2 1final
No ratings yet
TIA EIA 568 B.2 1final
86 pages
ICT Lab Activities Overview
No ratings yet
ICT Lab Activities Overview
13 pages
Community Needs Assessment Guide
100% (1)
Community Needs Assessment Guide
30 pages
PhonePe Statement Jun2025 Jun2025
No ratings yet
PhonePe Statement Jun2025 Jun2025
6 pages
Regular Expression, Rollover and Frames-1
No ratings yet
Regular Expression, Rollover and Frames-1
26 pages
Garmin Health Connect API Agreement
No ratings yet
Garmin Health Connect API Agreement
8 pages
Asynchronus Data-Link Protocols
100% (1)
Asynchronus Data-Link Protocols
15 pages
SITHIND002 (Assessment 1)
No ratings yet
SITHIND002 (Assessment 1)
20 pages
EPR Registration Process
No ratings yet
EPR Registration Process
3 pages
FUll List Test Bank and Solution Manual 2020-2021 (Student Saver Team) - Part 9
100% (1)
FUll List Test Bank and Solution Manual 2020-2021 (Student Saver Team) - Part 9
70 pages
Hytran Demo Install Instructions
No ratings yet
Hytran Demo Install Instructions
2 pages
Structed Query Language: Questions Answer
No ratings yet
Structed Query Language: Questions Answer
13 pages
Materi Data Visualisasi
No ratings yet
Materi Data Visualisasi
4 pages
Aromat 10-40 HMI Manual
No ratings yet
Aromat 10-40 HMI Manual
24 pages
Investor Presentation - December 2012
No ratings yet
Investor Presentation - December 2012
36 pages
AD 2020-10-05 Collins FMS
No ratings yet
AD 2020-10-05 Collins FMS
9 pages
Types of Electrical Wire
No ratings yet
Types of Electrical Wire
9 pages

Data-Driven Soccer Scouting

Uploaded by

Data-Driven Soccer Scouting

Uploaded by

Scouting Players with FIFA19

Applying Data Mining to Scouting

Data Driven Approach to Scouting

1. Finding undervalued players for a given club to acquire,

Team & Roles

Data Mining Steps:

You might also like