Monday, January 10, 2022

Bigtable lecture notes | Distributed system course | Rutgers university | Paul Krzyzanowski | Fall 2018

Jan. 10, 2022

Introduction

I like to type word by word and then I can learn better through my typing. I do think that it is best investment to learn lecture notes about BigTable. 

Lecture notes

Here is the lecture slides. 

Bigtable

  • Highly avaiable distributed storage
  • Built with semi-structured data in mind
    • URLs: content, metadata, links, anchors, page rank
    • User data: preferences, account info, recent queries
    • Geography: roads, satellite images, points of interest, annotations
  • Large scale    
    • Petabytes of data across thousands of servers
    • Billions of URLs with many versions per page
    • Hundreds of millions of users
    • thousands of queries per second
    • 100TB+ satellite image data
Uses
At Google, used for:
  • Google analytics
  • Google Finance
  • Personalized search
  • Blogger.com
  • Google Code hosting
  • Youtube
  • Gmail
  • Google Earth & Google Maps
  • Dozens of others... over sixty products
A big table
    Bigtable is NOT a relational database
    Bigtable appears as a large table
    "A Bigtable is a sparse, distributed, persistent multidimensional sorted map"
    
Will continue!


No comments:

Post a Comment