Referências
33
J. Dean and S. Ghemawat, “Mapreduce: Simplified data processing on large clusters,” in OSDI, 2004
Dean, J. and Ghemawat, S. 2008. MapReduce: simplified data processing on large clusters.
Communication of ACM 51, 1 (Jan. 2008), 107-113.
G. Malewicz, M. H. Austern, A. J. C. Matthew, J. C. Dehnert, I. Horn, N. Leiser, and G. Czajkowski,
“Pregel: A system for large-scale graph processing,” in SIGMOD, 2010.
M. Zharia, M. Chowdhury, T. Das, A. Dave, J. Ma, M. McCauley, M. J. Franklin, S. Shenker, and I.
Stoica, “Resilient distributed datasets: A fault-tolerant abstraction for in memory cluster
computing”, 2011.
Y. Low, J. Gonzalez, A. Kyrola, D. Bickson, C. Guestrin, and J. M. Hellerstein, “Distributed graphlab: A
framework for machine learning and data mining in the cloud,” PVLDB, vol. 5, no. 8, 2012.
A. Thusoo, J. S. Sarma, N. Jain, Z. Shao, P. Chakka, N. Zhang, S. Antony, H. Liu, and R. Murthy, “Hive a
petabyte scale data warehouse using hadoop” in ICDE, 2010.
A. F. Gates, O. Natkovich, S. Chopra, P. Kamath, S. M. Narayanamurthy, C. Olston, B. Reed, S.
Srinivasan, and U. Srivastava, “Building a high-level dataflow system on top of map-reduce: The pig
experience” PVLDB, 2009.
Apache Foundation, Hadoop, http://hadoop.apache.org/docs/current/