50% found this document useful (2 votes)
248 views2 pages

Assignment Name - Analytics Basics

The document provides 5 questions to answer regarding loading and analyzing an IMDB dataset using R and Python code. It asks the candidate to write code to load the data, extract unique genres and counts, convert data types, sort genres, and calculate the square of the difference between imdbrating and imdbvotes. The candidate must submit their answers and code as separate documents, and the project will be evaluated based on comprehensibility, code quality, and providing explanations.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
50% found this document useful (2 votes)
248 views2 pages

Assignment Name - Analytics Basics

The document provides 5 questions to answer regarding loading and analyzing an IMDB dataset using R and Python code. It asks the candidate to write code to load the data, extract unique genres and counts, convert data types, sort genres, and calculate the square of the difference between imdbrating and imdbvotes. The candidate must submit their answers and code as separate documents, and the project will be evaluated based on comprehensibility, code quality, and providing explanations.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 2

Assignment Name - Analytics basics

Problem Statement -
Answer the following questions to the best of your knowledge including the concepts taught
to you in the level. You can find the IMDB_data in the folder containing this file.

1. Write R code using data “IMDB_data” to


a. Load CSV in R by skipping second row.
b. Extract the unique genres and its count and store in data frame with index key.

c. Convert the required data types


d. Sort the genre by its name
e. Create new variable whose values should be square of difference between
imdbrating and imdbvotes.

2. Write Python code using data “IMDB_data” to


a. Load CSV in R by skipping second row.
b. Extract the unique genres and its count and store in data frame with index key.

c. Convert the required data types


d. Sort the genre by its name
e. Create new variable whose values should be square of difference between
imdbrating and imdbvotes.

3. Define problem category for below problem statement


“A chemist wants to find some interesting patterns in which patients are behaving upon
administering the drug”

4. How will you select suitable machine learning algorithm for a problem statement

5. Define one problem statement for Education industry?


Evaluation Basis
This project will be evaluated on following basis -

1) Comprehensibility of the answer : You have to answer the given question with full
explanation, background information and examples(if required).
2) Quality of code : Write the best possible implementation from your side ensuring the
coding practices and conventions taught to you during training.

Deliverables from Candidate

1) A Doc or Pdf file containing your answers to theory questions


2) Code files - R and Python separate.
3) Notes to explain your code logic (if required)

Warning - Do not submit incomplete or wrong assignments. They will result in


negative skill score. Also, you will not a get a second attempt at the assignment!
So whatever score you get will be the final score

You might also like