Apache Flume: Distributed Log Collection for Hadoop

Name: Apache Flume: Distributed Log Collection for Hadoop
Author: Steve Hoffman

If your role includes moving datasets into Hadoop, this book will help you do it more efficiently using Apache Flume. From installation to customization, it's a complete step-by-step guide on making the service work for you.

Steve Hoffman

Medizin, Wissenschaft & Technik

Bisher keine Bewertungen

0.0

Entdecke diesen und 400.000 weitere Titel mit der Flatrate von Skoobe. Ab 12,99 € im Monat.

Teste 30 Tage kostenlos

Beschreibung zu „Apache Flume: Distributed Log Collection for Hadoop“

Apache Flume is a distributed, reliable, and available service for efficiently collecting, aggregating, and moving large amounts of log data. Its main goal is to deliver data from applications to Apache Hadoop's HDFS. It has a simple and flexible architecture based on streaming data flows. It is robust and fault tolerant with many failover and recovery mechanisms.
Apache Flume: Distributed Log Collection for Hadoop covers problems with HDFS and streaming data/logs, and how Flume can resolve these problems. This book explains the generalized architecture of Flume, which includes moving data to/from databases, NO-SQL-ish data stores, as well as optimizing performance. This book includes real-world scenarios on Flume implementation.
Apache Flume: Distributed Log Collection for Hadoop starts with an architectural overview of Flume and then discusses each component in detail. It guides you through the complete installation process and compilation of Flume.
It will give you a heads-up on how to use channels and channel selectors. For each architectural component (Sources, Channels, Sinks, Channel Processors, Sink Groups, and so on) the various implementations will be covered in detail along with configuration options. You can use it to customize Flume to your specific needs. There are pointers given on writing custom implementations as well that would help you learn and implement them.
By the end, you should be able to construct a series of Flume agents to transport your streaming data and logs from your systems into Hadoop in near real time.

Verlag:

Packt Publishing

Veröffentlicht:

2013

Druckseiten:

ca. 88

Sprache:

English

Medientyp:

eBook

Ähnliche Titel wie „Apache Flume: Distributed Log Collection for Hadoop“

Learning Search-driven Application Development with SharePoint 2013

Johnny Tordgeman

HBase Administration Cookbook

Yifeng Jiang

Hadoop MapReduce Cookbook

Srinath Perera

Instant Apache Sqoop

Ankit Jain

Microsoft SQL Server 2012 with Hadoop

Debarchan Sarkar

Hadoop Real-World Solutions Cookbook

Jonathan R. Owens

jQuery Selectors

Aurelio De Rosa

Apache Solr 4 Cookbook

Rafal Kuc

HDInsight Essentials

Rajesh Nadipalli

Instant Apache Maven Starter

Maurizio Turatti

Lesen. Hören. Bücher erleben.

Jetzt kostenlos testen

Wir verwenden Cookies, um Inhalte zu personalisieren, Funktionen für soziale Medien anbieten zu können und die Zugriffe auf unsere Website zu analysieren. Details ansehen.