Did you know our first e-book was published in May 2012? jQuery Succinctly was the start of over one hundred free e-books written to help developers get free access to software development resources and strategies. Capturing and understanding big data is by far one of the most-asked-about topics across both our support and learning sites.
Ranked in order of popularity, here are Succinctly e-books related to big data to add to your reading list:
An Apache open source project, Hadoop stores huge amounts of data in safe, reliable storage and runs complex queries over data in an efficient way. It is at the core of a whole host of the most popular big data tools. Mastering Hadoop ensures you get the best out of all these tools and better insight from your data. Elton Stoneman’s Hadoop Succinctly explains how Hadoop works, what goes on in the cluster, demonstrating how to move data in and out of Hadoop, and how to query it efficiently. It also walks through a Java MapReduce example, illustrates how to write the same query in Python and .NET, and discusses the wider Hadoop ecosystem.
Apache Cassandra is an open-source distributed database system noted for its consistency, its ability to run on commodity hardware, and its easy deployment to cloud infrastructure. With Cassandra Succinctly by Marko Švaljek, you will learn the architecture behind Cassandra that makes it a reliable system, including how hashes are used to identify where data is stored, how choosing the right replication factor can enhance or hinder performance, and how the system handles updating and deleting data. By the end of the book, you will be able to store and model data using the Cassandra Query Language, and use Cassandra within your own applications.
Mastering big data requires an aptitude at every step of information processing. Post-processing, one of the most important steps, is where you find Apache Spark frequently employed. Spark Succinctly, by Marko Švaljek, addresses Spark’s use in the ultimate step in handling big data. This e-book, the third installment in Švaljek’s IoT series, teaches the basics of using Spark and explores how to work with RDDs, Scala and Python tasks, JSON files, and Cassandra.
Neural networks are an exciting field of software development used to calculate outputs from input data. While the idea seems simple enough, the implications of such networks are staggering—think optical character recognition, speech recognition, and regression analysis. With Neural Networks Using C# Succinctly by James McCaffrey, you’ll learn how to create your own neural network to solve classification problems, or problems where the outcomes can only be one of several values. Learn about encoding and normalizing data, activation functions and how to choose the right one, and ultimately how to train a neural network to find weights and bias values that provide accurate predictions.
Capturing and extracting information is one of the most important tasks a developer can perform, and making this task more engaging without relying entirely on specialized tools is an efficient way to improve productivity. In Data Capture and Extraction with C# Succinctly, author Ed Freitas guides readers toward getting more out of C# in minimal time.
Let us know which one is your favorite in the comments!