§galletasdigitales

  • Archivo
  • RSS
  • Pregunta lo que quieras
lickystickypickyme:

How Google manages data.
1. Collect MapReduce doesn’t depend on a traditional structured database, where  information is categorized as it’s collected. We’ll just gather up the  full text of every book Google has scanned.
2. Map You write a function to map the data: “Count every use of every word in  Google Books.” That request is then split among all the computers in  your army, and each agent is assigned a hunk of data to work with.  Computer A gets War and Peace, for example. That machine  knows what words that book contains, but not what’s inside Anna  Karenina.
3. Save Each of the hundreds of PCs doing a map writes the results to its local  hard drive, cutting down on data transfer time. The computers that have  been assigned “reduce” functions grab the lists from the mappers.
4. Reduce The Reduce computers  correlate the lists of words. Now you know how many times a particular  word is used, and in which books.
5. Solve The result? A data set about your data. In our example, the final list  of words is stored separately so it can be quickly referenced or  queried: “How often does Tolstoy mention Moscow? Paris?” You don’t have  to plow through unrelated data to get the answer.
 Wired
Pop-upView Separately

lickystickypickyme:

How Google manages data.

1. Collect
MapReduce doesn’t depend on a traditional structured database, where information is categorized as it’s collected. We’ll just gather up the full text of every book Google has scanned.

2. Map
You write a function to map the data: “Count every use of every word in Google Books.” That request is then split among all the computers in your army, and each agent is assigned a hunk of data to work with. Computer A gets War and Peace, for example. That machine knows what words that book contains, but not what’s inside Anna Karenina.

3. Save
Each of the hundreds of PCs doing a map writes the results to its local hard drive, cutting down on data transfer time. The computers that have been assigned “reduce” functions grab the lists from the mappers.

4. Reduce
The Reduce computers correlate the lists of words. Now you know how many times a particular word is used, and in which books.

5. Solve
The result? A data set about your data. In our example, the final list of words is stored separately so it can be quickly referenced or queried: “How often does Tolstoy mention Moscow? Paris?” You don’t have to plow through unrelated data to get the answer.


 Wired

Fuente: lickystickypickywe

    • #google
  • hace 1 año > lickystickypickywe
  • 73
  • Enlace permanente
  • Share
    Tweet

73 Notes/ Hide

  1. A johnmichel le gusta esto
  2. A ifonearth le gusta esto
  3. jazmokology ha reblogueado esto desde shaneguiter
  4. betacar ha reblogueado esto desde proofmathisbeautiful
  5. A icantreadpoetry le gusta esto
  6. A superrrsara le gusta esto
  7. A loichay le gusta esto
  8. A ianultra le gusta esto
  9. flyonair ha reblogueado esto desde dans-ce-pot
  10. dans-ce-pot ha reblogueado esto desde proofmathisbeautiful
  11. A marukido le gusta esto
  12. A aubreymcfato le gusta esto
  13. A cloois le gusta esto
  14. A feeqahaballah le gusta esto
  15. A atomic-oxygen le gusta esto
  16. A s0leful0ne le gusta esto
  17. mohammednasim ha reblogueado esto desde lickystickypickywe
  18. A herheartdances le gusta esto
  19. A sedso le gusta esto
  20. pixiesuicide ha reblogueado esto desde proofmathisbeautiful
  21. azfarmukmin ha reblogueado esto desde lickystickypickywe
  22. vovomark ha reblogueado esto desde proofmathisbeautiful
  23. A borderingoninsanity le gusta esto
  24. A macmankev le gusta esto
  25. A redcloud le gusta esto
  26. 5hane ha reblogueado esto desde shaneguiter y ha añadido:
    I was just reading about this the other day. Map/Reduce and BigTable are really cool. Reminds me of using a botnet to...
  27. A 5hane le gusta esto
  28. shaneguiter ha reblogueado esto desde danimunoz
  29. A webbo le gusta esto
  30. ublockedmeonfacebook ha reblogueado esto desde lickystickypickywe
  31. A rollsofrice le gusta esto
  32. A amandamaries le gusta esto
  33. cingulomania ha reblogueado esto desde proofmathisbeautiful
  34. A yo-yoyosh le gusta esto
  35. A alabagazoonto le gusta esto
  36. A teamlimabean le gusta esto
  37. A dlbsandwich le gusta esto
  38. firesaw ha reblogueado esto desde proofmathisbeautiful
  39. A wildorchidz le gusta esto
  40. A wordcard le gusta esto
  41. A aircount-brusherfeit le gusta esto
  42. A omg le gusta esto
  43. A kaching le gusta esto
  44. A messymorsels le gusta esto
  45. roomthily ha reblogueado esto desde proofmathisbeautiful
  46. A tomorrowsneverdie le gusta esto
  47. proofmathisbeautiful ha reblogueado esto desde lickystickypickywe
  48. leopardsblog ha reblogueado esto desde lickystickypickywe
  49. enoughthunder ha reblogueado esto desde lickystickypickywe
  50. A feedwell le gusta esto
  51. Mostrar más notasCargando...
← Anterior • Siguiente →

Logo

Acerca de

Curiosidades tecnológicas para el geek atareado

Páginas

  • Quién soy

En las redes sociales

  • @danielmp on Twitter
  • Facebook Profile
  • danielmp on Foursquare
  • Google
  • Linkedin Profile

Twitter

loading tweets…

  • RSS
  • Aleatorio
  • Archivo
  • Pregunta lo que quieras
  • Móvil

Effector Theme by Carlo Franco.

Proporcionado por Tumblr