0% found this document useful (0 votes)
44 views1 page

Practical 3

The document outlines two assignments involving MapReduce programming. The first task requires calculating the size of each word and counting the occurrences of words of that size from a text file. The second task involves processing a dataset of patent records to determine the number of sub-patents associated with each patent.

Uploaded by

ANKIT MITTAL
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as TXT, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
44 views1 page

Practical 3

The document outlines two assignments involving MapReduce programming. The first task requires calculating the size of each word and counting the occurrences of words of that size from a text file. The second task involves processing a dataset of patent records to determine the number of sub-patents associated with each patent.

Uploaded by

ANKIT MITTAL
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as TXT, PDF, TXT or read online on Scribd
You are on page 1/ 1

Q.

1Apply your MapReduce programming knowledge and write a MapReduce program to


calculate the size of each word and
count the number of words of that size in the text file.

Sample Output
WordSize Word Count
1 1
2 4
3 3
4 6
Your task in this assignment is to process the �alphabets� text file using
MapReduce program.
Your program should calculate the size of each word (Number of alphabets in the
word) and
count the number of words of that size in this text file.

Q.2 Apply your MapReduce programming knowledge and write a MapReduce program to
process a dataset with patent records.
You need to calculate the number of sub-patents associated with each patent.
The dataset for this problem is the �pa.txt� records file . Each patent has sub-
patent id�s associated with it.
You need to calculate the number of sub-patent associated with each patent.
Here is the sample output:

Patent Number of Associated Sub-patents


1 13
2 10
3 4

You might also like