0% found this document useful (0 votes)
14 views9 pages

Problem Statement (Short) : What Exact Problem Is Being Addressed?

hijhhgugyuggyug

Uploaded by

joy.banerjee1908
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
14 views9 pages

Problem Statement (Short) : What Exact Problem Is Being Addressed?

hijhhgugyuggyug

Uploaded by

joy.banerjee1908
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 9

✅ Problem Statement (short)

The PM Internship Scheme aims to place large numbers of students into internships, but
current manual/mismatch processes cause delays, low uptake, and inequitable placement. A
scalable, AI-driven matchmaking platform is needed to automatically match students to
internships using multi-criteria signals (skills, qualifications, location, sector interest,
affirmative-action priorities, past participation, and industry capacity), while remaining
transparent, auditable, and inclusive. (DD News, The Times of India)

1. Pain Points & Core Understanding 🔎


What exact problem is being addressed?

• Matching thousands–millions of applicants to internships is manual, slow, opaque,


and often unfair — resulting in unfilled roles, high dropout, and mismatches between
intern skills and job needs. (The Times of India)

Why does this problem exist (root causes)?

• Scale: candidate and vacancy volumes overwhelm manual matching.


• Heterogeneous criteria: skills, location, reservation/affirmative action priorities, past
participation, and capacity constraints create complex constraints.
• Data gaps & quality: inconsistent CV/profile details and non-standard vacancy
descriptions.
• Lack of automation & transparency: no single system to rank/filter candidates by
multi-criteria fairness.
• Low engagement & trust: students unsure about internship quality and stipend
clarity. (DD News, The Times of India)

Primary stakeholders / users

• Students (aspirants seeking internships)


• Industry partners (companies posting internships)
• Scheme administrators (government/placement cells)
• Colleges/universities (verification, onboarding)
• Regional nodal officers (monitor quotas, affirmative action)

Current challenges / inefficiencies

• Repeated manual verification, time-consuming sorting/filtering.


• Hard to enforce affirmative action or district/sector quotas fairly at scale.
• Unclear matching criteria leads to poor intern experience & low completion.
• Dropouts & underutilized positions (pilot reports show low conversion). (The Times
of India)

2. Feasibility of Execution ⚙️
Can a working prototype be built in a hackathon?

Yes — a functional prototype (candidate & employer dashboard, matching engine using off-
the-shelf ML/NLP, basic admin console, and notifications) is achievable in 48–72 hours with
mock data and focused scope.

Key technical requirements

• Backend: Python (Flask/FastAPI) + ML libraries (scikit-learn, TensorFlow/PyTorch


for advanced models).
• Matching models:
o Content-based (skills ↔ job requirements via embeddings / TF-IDF / BERT)
o Collaborative filtering / ranking model for preference signals
o Constraint solver for quotas/affirmative action (linear programming / matching
algorithm)
• NLP: Parse free-text job descriptions & CVs (spaCy / Transformers).
• DB: PostgreSQL / MongoDB for profiles, Redis for caching.
• APIs & Integrations: Google Maps (geolocation), SMS gateway (Twilio) /
WhatsApp for low-connectivity notifications.
• UI: React for web dashboards; optional Flutter for mobile prototype.
• Security: OAuth2, role-based access, encryption at rest/in transit.
• Hosting: Cloud (AWS/GCP/Azure) — serverless components to scale.
• Monitoring: Simple dashboards (Grafana / Cloudwatch) + logging. (Internshala)

Possible blockers

• Authoritative data / verification (academic credentials, past participation).


• Privacy & legal constraints — handling student personal data (consent, retention).
• Industry engagement — ensuring accurate vacancy metadata.
• Algorithmic fairness — implementing legally compliant affirmative-action logic.
(See research on India’s reservation complexities). (tayfunsonmez.net, arXiv)

Hackathon MVP to impress evaluators

• Candidate & employer sign-up + profile creation (structured skills, geo, categories).
• Vacancy creation UI with capacity, quotas, and preference fields.
• Matching engine that outputs ranked matches with a transparent "Match Score" and
constraint metadata (why matched, quota applied).
• Admin dashboard showing aggregate KPIs (placements, unfilled slots, demographic
breakdown).
• Notifications (email/SMS) pipeline for match offers and acceptance flows.

3. Impact & Relevance 🌍


Who benefits?

• Students: better, faster placements tailored to skills & constraints.


• Industry: higher quality candidate matches; fewer no-shows.
• Government: streamlined scheme administration, measurable KPIs.
• Colleges: improved placement records & reporting.

Real-world impact

• Economic: more employable youth; reduced friction/time-to-onboard.


• Social: inclusive opportunities for disadvantaged students via prioritized matching.
• Programmatic: increased utilization of vacancies and improved trust in the scheme.
(Pilot reports show dropouts — reducing that increases effectiveness). (The Times of
India)

Scalability

• Architecture can scale to state/national level using cloud auto-scaling and


batch/asynchronous matching pipelines.
• Policy logic (reservation, quotas) should be configurable per state/sector.

Why evaluators care

• Addresses a high-visibility national objective (large beneficiary counts).


• Demonstrates measurable KPIs (placements, fill-rate, fairness metrics).
• Feasible, replicable, and directly impactful.
4. Scope of Innovation (Existing Solutions) 💡
Existing products / platforms
Platform What it does well Limitations (relevant to PM Scheme)

Large marketplace for Marketplace focus — limited support for


Internshala
internships; intuitive UI; govt-level quotas/affirmative action & large
(India)
notifications. batch matching. (Internshala)

Large employer reach, Not optimized for internship quotas,


LinkedIn /
profile parsing, matching rural/aspirational priority, or mass
Naukri
heuristics. government rollouts.

Custom Govt Reports of low uptake and coordination


Centralized registration
Portals / PMIS problems; limited matching
& stipend tracking.
app sophistication. (DD News, Google Play)

Research & papers (short)

• Recommendation systems & internships — surveys and experimental designs show


hybrid models (content + collaborative) perform best for placements. (Emerald,
ScienceDirect)
• Algorithmic fairness in India: frameworks to operationalize fairness must account for
India’s complex vertical/horizontal reservation system. Use specialized algorithms to
enforce policy constraints. (tayfunsonmez.net, arXiv)

Competitor analysis (quick takeaways)

• Market platforms (Internshala) excel at discovery & UX but lack policy-aware


matching and bulk orchestration.
• Govt apps exist for registration/stipends but often lack advanced matching,
transparency, and industry quality control. (DD News, Internshala)

Innovative differentiators you can build

• Policy-aware matching engine: built-in constraint solver for quotas/reservations with


traceable audit logs.
• Transparent match scores: show contribution of each factor (skills, location,
affirmative-action weight).
• Explainable AI: provide human-readable reasons for match ranking to increase trust.
• Capacity-aware offers: support batch offers and cascading offers if first accepts
decline.
• Localized UX & offline support: regional languages, SMS/IVR for low-connectivity
users.
• Verification pipeline: lightweight document verification via college nodal approvals
or blockchain-anchored hashes (optional).
• Monitoring & fairness dashboards: live KPIs + fairness metrics (e.g., selection
rates across categories).

5. Clarity of Problem Statement 🧩


What exactly is being asked?

• Build an AI/algorithmic matching system that:


o Accepts student profiles and internship listings,
o Matches using multi-criteria (skills, location, sector, prior participation),
o Enforces affirmative action/quotas and capacity,
o Presents transparent dashboards and notifications,
o Uses widely available cloud infrastructure / opensource tools for replicability.

Where teams may misinterpret

• Thinking the goal is a job board (discovery) rather than policy-aware, batch,
scalable matching.
• Over-engineering verification (full KYC/blockchain) when a simple college
verification flag is enough for MVP.
• Missing the "inclusivity" constraint (affirmative action and aspirational district
preferences).

Framing the solution for evaluators

• Clearly separate matching logic (algorithm + constraints) from UX


(candidate/employer/admin).
• Demonstrate one full cycle end-to-end: sign-up → match generation → offer →
acceptance → allocation (with KPIs).
• Provide transparency & fairness artifacts (how quotas were applied, match score
decomposition).

6. Evaluator’s Perspective 🎯
How evaluators will judge

• Impact & scaleability: Can it handle target volumes? (show design & KPIs)
• Feasibility & demo: Working prototype with real flows & mock data.
• Inclusion & fairness: How does it enforce affirmative action and fairness metrics?
• Technical novelty: Hybrid matching + explainability + constraint solver.
• Policy alignment: Configurable quotas per scheme/state.

Top scoring criteria

• Clear MVP demonstrating match quality and quotas enforcement.


• Realistic deployment plan & cost estimates.
• Usability for low-tech users (regional languages, SMS).
• Measurable KPIs for fairness, fill rates, and time-to-place.

Red flags for evaluators

• Opaque matching (“black box”) with no explainability.


• No plan for verification or handling of bad/insufficient data.
• Ignoring affirmative action constraints or local policy requirements.
• Overambitious scope with no working demo.

7. Strategy for Team Fit & Execution 👥


Skills needed

• ML/NLP Engineer — models for ranking and parsing job/candidate text.


• Backend Engineer — API, matching pipeline, constraint solver.
• Frontend Engineer / Designer — dashboards & forms (React / Flutter).
• Product / Policy Lead — maps reservation/affirmative action rules into constraints.
• Data Engineer / QA — data cleaning, mock dataset generation, testing.
• DevOps — deployment (cloud functions, CI/CD), monitoring.

Ideal team ratio (5–6 people)

• ML/NLP: 1
• Backend: 1–2
• Frontend/UX: 1
• Product/Policy + Presenter: 1
• DevOps/Data: 1 (can be combined with backend)

Step-by-step pre-build research & ideation

1. Understand policy: codify reservation rules (vertical/horizontal, aspirational


districts).
2. Data modeling: define canonical schema for profiles & vacancies.
3. Build mock dataset: 500 candidate profiles + 200 vacancy postings (varied
geographies & quotas).
4. Define KPIs: fill rate, match precision@k, fairness metrics (selection rate by group).
5. Select models: embedding + ranking pipeline; light constraint solver (ILP / greedy +
backtracking).
6. Prototype UI flows: profile creation, vacancy posting, match results, admin
dashboard.
7. Test fairness: simulate scenarios and show how quotas are enforced.
8. Prepare demo script: show 2–3 end-to-end cases (urban candidate, aspirational
district candidate, industry batch offer).

📦 Technical Architecture (concise)


[Candidate UI] <---> [API Gateway] <---> [Matching Engine: Ranking +
Constraint Solver]
|---> [Profile DB (Postgres/Mongo)]
|---> [Vacancy DB]
|---> [Audit Log / Fairness DB]
|---> [Notifications (SMS/Email)]
|---> [Admin Dashboard]

⚡ Hackathon MVP Checklist (deliverables to


implement/demo)
• Candidate & Employer sign-up + structured profile forms (region, skills,
categories).
• Vacancy posting UI (capacity, quotas, location, required skills).
• Matching pipeline that returns ranked matches + per-match explanation.
• Admin dashboard: placements, unfilled slots, demographic breakdown.
• Notification flow: match offer → candidate acceptance → allocation.
• Small mock dataset (≥100 candidates, ≥50 vacancies) and 3 demo scenarios
(including affirmative-action case).

🔍 Competitor & Research References (selected)


• PM Internship Scheme overview & rollout goals. (DD News, National Government
Services Portal)
• Reports on low participation / implementation issues (pilot challenges). (The Times of
India)
• Internshala — practical marketplace & UX reference. (Internshala)
• Algorithmic fairness and India–specific reservation frameworks. (tayfunsonmez.net,
arXiv)
• Surveys & research on recommendation systems for internships/placements.
(Emerald, ScienceDirect)

⚖️ Ethical & Policy Notes (must-have)


• Explainability: store and surface reasons for matches (regulatory & trust).
• Consent & data minimization: collect only necessary PII; explicit consent for
stipend/KYC flows.
• Audit logs: immutable logs for allocation & quota enforcement.
• Bias testing: measure disparate impact and selection rates across groups.

🎯 Quick Demo Script (what to show judges)


1. Create candidate A (urban, CS skills), candidate B (aspirational district, similar
skills), and employer posting with capacity 2 and 1 slot reserved for aspirational
district.
2. Run matching → show how candidate B gets reserved slot due to quota (explainable
match score).
3. Admin dashboard: show fill-rate and fairness KPI.
4. Show automated notification to candidate and employer.
5. End with a short cost & scaling plan.

🔑 Key Takeaways
• Build a policy-aware, explainable matching platform that focuses on inclusion and
scalability.
• Use hybrid matching (NLP embeddings + ranking + constraints) and provide
transparent match scores for trust.
• MVP in a hackathon: matching pipeline + dashboards + notifications with mock data
— doable and impactful.
• Watch out for data verification, legal/privacy compliance, and the complexity of
India’s reservation policies — treat them as explicit constraints, not soft preferences.
(tayfunsonmez.net, arXiv)

You might also like