Toward Scalable Systems for Big Data Analytics: A Technology Tutorial
Tóm tắt
Từ khóa
Tài liệu tham khảo
rao, 2012, Survey on Improved Scheduling in Hadoop MapReduce in Cloud Environments
yu, 2008, DryadLINQ: A system for general-purpose distributed data-parallel computing using a high-level language, Proc 8th USENIX Conf Oper Syst Des Implement, 1
zaharia, 2012, Resilient distributed datasets: A fault-tolerant abstraction for in-memory cluster computing, Proc 9th USENIX Conf Netw Syst Des Implement, 2
peng, 2010, Large-scale incremental processing using distributed transactions and notifications, Proc 9th USENIX Conf Oper Syst Des Implement, 1
agrawal, 0, Challenges and opportunities with big data—A community white paper developed by leading researchers across the united states
hey, 2009, The Fourth Paradigm Data-Intensive Scientific Discovery
2013, Vertica
sandholm, 2010, Dynamic proportional share scheduling in Hadoop, Job Scheduling Strategies for Parallel Processing, 110, 10.1007/978-3-642-16505-4_7
goodhope, 2012, Building linkedins real-time activity data pipeline, Data Eng, 35, 33
yong, 2009, Towards a resource aware scheduler in Hadoop, Proc Int Conf Web Services (ICWS), 102
2013, Storm
chen, 2011, The case for evaluating mapreduce performance using workload suites, Proc IEEE 19th Int Symp Model Anal Simul Comput Telecommun Syst (MASCOTS), 390, 10.1109/MASCOTS.2011.12
2013, Gray Sort
2013, Pig Mix
2013, Grid Mix
murray, 2011, Ciel: A universal execution engine for distributed data-flow computing, Proc 8th USENIX Conf Netw Syst Des Implement, 9
nambiar, 2006, The making of TPC-DS, Proc 32nd Int Conf Very Large Data Bases (VLDB) Endowment, 1049
2013, TPC benchmarks
richardson, 2008, Magic Quadrant for Business Intelligence Platforms
blackett, 2013, Analytics Network-O R Analytics
eschenfelder, 1980, Data Mining and Knowledge Discovery Handbook, 14
friedman, 2008, Data visualization and infographics
2013, iPlant
foundation, 2013, Core Techniques and Technologies for Advancing Big Data Science and Engineering
economist, 2011
2013, Aster Data
2013, Netezza
2013, Greenplum[EB/OL]
team, 2011, Big Data Now Current Perspectives from O’Reilly Radar
marche, 2012, Is Facebook making us lonely, Atlantic, 309, 60
grobelnik, 2012, Big Data Tutorial
2013, Summingbird
2014, Teradata
chen, 2012, We don’t know enough to make a big data benchmark suite—An academia-industry view, Proc Workshop Big Data Benchmarking (WBDB)
gantz, 2010, The digital universe decade-are you ready, IDC White Paper
layton, 2013, How Amazon Works
2013, DEX
2013, Neo4j
baker, 2011, Megastore: Providing scalable, highly available storage for interactive services, Proc Conf Innov Database Res (CIDR), 223
2013, Hypert
2013, Mongodb
crochford, 2006, RFC 4627 - The Application/json Media Type for JavaScript Object Notation (JSON)
burrows, 2006, The chubby lock service for loosely-coupled distributed systems, Proc Symp Oper Syst Des Implementation, 335
lakshman, 2009, Cassandra: Structured storage system on a p2p network, Proc ACM Symp Principles Distributed Computing, 5, 10.1145/1582716.1582722
2013, HBase
laurila, 2012, The mobile data challenge: Big data for mobile computing research, Proc 10th Int Conf Pervas Comput Workshop Nokia Mobile Data Challenge Conjunct, 1
wang, 2012, Semantically-aware data discovery and placement in collaborative computing environments
2013, ATLAS
2013, SDSS
2013, A Comprehensive List of Big Data Statistics
gallagher, 2013, The Big Data Value Chain
walker, 1996, MPI: A standard message passing interface, Supercomputer, 12, 56
tanenbaum, 2006, Distributed Systems Principles and Paradigms
economist, 2011, Drowning in Numbers—Digital Data Will Flood the Planet- and Help us Understand it Better
cukier, 2010, Data, data everywhere, Economist, 394, 3
noguchi, 2011, Following Digital Breadcrumbs to Big Data Gold
lohr, 2012, New York Times, 11
house, 2012, Fact Sheet Big Data Across the Federal Government
2013, eBay Study How to Build Trust and Improve the Shopping Experience
noguchi, 2011, The Search for Analysts to Make Sense-of-Big-Data
corbett, 2012, Spanner: Google’s globally-distributed database, Proc 10th Conf Oper Syst Des Implement (OSDI)
kelly, 2013, Taming Big Data
evans, 2010, The explosion of data
2013, What is Big Data?
sevilla, 2012, Big Data Vendors and Technologies the list!
jain, 1999, Biometrics Personal Identification in Networked Society
choudhary, 2012, Crawling rich internet applications: The state of the art, Proc Conf Center Adv Studies Collaborative Res (CASCON), 146
2013, Robots
ceriotti, 2009, Monitoring heritage buildings with wireless sensor networks: The Torre Aquila deployment, Proc Int Conf Inf Process Sensor Netw, 277
2013, Journal of Scientific Instruments
wahab, 2008, Data pre-processing on web server logs for generalized association rules mining algorithm, World Acad Sci Eng Technol, 48, 970
friedman, 0, Data visualization Modern approaches
ye, 2010, DOS—A scalable optical switch for datacenters, Proc 6th ACM/IEEE Symp Archit Netw Commun Syst, 1
anderson, 2003, An Introduction to Multivariate Statistical Analysis
müller, 2005, Problems Methods and Challenges in Comprehensive Data Cleansing
goutelle, 2005, A survey of transport protocols other than standard TCP
jinno, 2009, Dynamic optical mesh networks: Drivers, challenges and solutions for the future, Proc Eur Conf Optical Communication (ECOC), 1
2009, Cisco Data Center Interconnect Design and Deployment Guide
hoelzle, 2009, The Datacenter as a Computer An Introduction to the Design of Warehouse-Scale Machines
han, 2006, Data Mining Concepts and Techniques
manning, 1999, Foundations of Statistical Natural Language Processing
ritter, 2011, Named entity recognition in tweets: An experimental study, Proc Conf Empirical Methods Nat Lang Process, 1524
maybury, 2004, New Directions in Question Answering
konopnicki, 1995, W3QS: A query system for the world-wide web, Proc Int Conf On Very Large Data Bases, 54
hu, 2011, A survey on visual content-based video indexing and retrieval, IEEE Trans Syst Man Cybern C Appl Rev, 41, 797, 10.1109/TSMCC.2011.2109710
li, 2008, Discriminant locally linear embedding with high-order tensor data, IEEE Trans Syst Man Cybern B Cybern, 38, 342, 10.1109/TSMCB.2007.911536
li, 2010, L1-norm-based 2DPCA, IEEE Trans Syst Man Cybern B Cybern, 40, 1170, 10.1109/TSMCB.2009.2035629
watts, 2004, Six Degrees The Science of a Connected Age
jiang, 2010, Columbia-UCF TRECvid2010 multimedia event detection: Combining multiple modalities, contextual concepts, and temporal matching, Proc Nat Inst Standards Technol (NIST) TRECvid Workshop, 2, 6
mell, 2009, The NIST definition of cloud computing, National Inst Standards Technol, 53, 50
troppens, 2011, Storage Networks Explained Basics and Application of Fibre Channel SAN NAS ISCSI Infiniband and FCoE
guerra, 2011, Cost effective storage using extent based dynamic tiering, Proc 9th USENIX Conf File Stroage Technol (FAST), 273
soundararajan, 2010, Extending SSD lifetimes with disk-based write caches, Proc 8th USENIX Conf File Storage Technol, 8
clark, 2005, Storage Virtualization Technologies for Simplifying Data Storage and Management
2013, Hadoop Distributed File System
beaver, 2010, Finding a needle in Haystack: Facebook’s photo storage, Proc 9th USENIX Conf Oper Syst Des Implement (OSDI), 1
2013, Taobao File System
2013, Kosmosfs
decandia, 2007, Dynamo: Amazon’s highly available key-value store, SIGOPS Oper Syst Rev, 41, 205, 10.1145/1323293.1294281
2013, Fast Distributed File System
2013, Voldemort
2013, Redis
2013, Tokyo Canbinet
2013, Tokyo Tyrant
2013, Memcached
2013, Memcached
2013, Riak
manyika, 2011, Big Data The Next Frontier for Innovation Competition and Productivity, 1
2013, Scala
gantz, 2012, The digital universe in 2020: Big data, bigger digital shadows, and biggest growth in the far east, IDC IView IDC Analyze the Future
dai, 2008, Translated learning: Transfer learning across different feature spaces, Proc Adv Neural Inform Process Syst (NIPS), 353
hu, 0, Towards multi-screen social tv with geo-aware social sense, IEEE Multimedia
maletic, 2000, Data cleansing: Beyond integrity analysis, Proc Conf Inform Qual, 200
silberschatz, 1997, Database System Con-cepts, 4
salomon, 2004, Data Compression
2013, Cisco visual networking index: Global mobile data traffic forecast update
2013, Applications and Organizations Using Hadoop
white, 2012, Hadoop The Definitive Guide
gantz, 2011, Extracting value from chaos, Proc IDC iView, 1
zikopoulos, 2011, Understanding Big Data Analytics for Enterprise Class Hadoop and Streaming Data
laney, 2001, 3d data management: Controlling data volume, velocity and variety
cooper, 2012, Tackling Big Data
symes, 2004, Digital Video Compression
condie, 2010, Mapreduce online, Proc 7th USENIX Conf Netw Syst Des Implement, 21