Wednesday, June 23, 2021

Map Reduce Paper - Distributed data processing | My 10 minutes study

June 23, 2021

Here is the link. 

Paper that inspired Hadoop. This video explains Map Reduce concepts which is used for distributed big data processing. This video takes some liberties to explain the underlying concept as simply as possible. For example; the map process for song count is typically implemented as, emit number 1 for each song title. After this a combiner function is used to locally aggregate/sum these counts per song. Also, this video leaves out many implementation details, which are interesting. I encourage you to read the paper for them.


No comments:

Post a Comment