• Data Science and Big Data Analytics: Discovering, Analyzing, Visualizing and Presenting Data

    Data Science and Big Data Analytics is about harnessing the power of data for new insights. The book covers the breadth of activities and methods and tools that Data Scientists use. The content focuses on concepts, principles and practical applications that are applicable to any industry and technology environment, and the learning is supported and explained with examples that you can replicate using open-source software. This book will help you: Become a contributor on a data science team Deploy a structured lifecycle approach to data analytics problems Apply appropriate analytic techniques ... [Read More]

    • ASIN: 111887613X
    • ASIN: 111887613X
    • ISBN: 111887613X
    • Brand: imusti
    • Manufacturer: Wiley

  • Designing Data-Intensive Applications: The Big Ideas Behind Reliable, Scalable, and Maintainable Systems

    Data is at the center of many challenges in system design today. Difficult issues need to be figured out, such as scalability, consistency, reliability, efficiency, and maintainability. In addition, we have an overwhelming variety of tools, including relational databases, NoSQL datastores, stream or batch processors, and message brokers. What are the right choices for your application? How do you make sense of all these buzzwords?In this practical and comprehensive guide, author Martin Kleppmann helps you navigate this diverse landscape by examining the pros and cons of various technologies fo... [Read More]

    • ASIN: 1449373321
    • ASIN: 1449373321
    • ISBN: 1449373321
    • Manufacturer: O'Reilly Media

  • Big Data: Principles and best practices of scalable realtime data systems

    SummaryBig Data teaches you to build big data systems using an architecture that takes advantage of clustered hardware along with new tools designed specifically to capture and analyze web-scale data. It describes a scalable, easy-to-understand approach to big data systems that can be built and run by a small team. Following a realistic example, this book guides readers through the theory of big data systems, how to implement them in practice, and how to deploy and operate them once they're built.Purchase of the print book includes a free eBook in PDF, Kindle, and ePub formats from Manning Pub... [Read More]

    • ASIN: 1617290343
    • ASIN: 1617290343
    • ISBN: 1617290343
    • Brand: Manning Publications
    • Manufacturer: Manning Publications

  • Hadoop BIG DATA Interview Questions You'll Most Likely Be Asked (Job Interview Questions Series)

    • 200 Hadoop BIG DATA Interview Questions • 76 HR Interview Questions • Real life scenario based questions• Strategies to respond to interview questions• 2 Aptitude TestsHadoop BIG DATA Interview Questions You'll Most Likely Be Asked is a perfect companion to stand ahead above the rest in today’s competitive job market. Rather than going through comprehensive, textbook-sized reference guides, this book includes only the information required immediately for job search to build an IT career. This book puts the interviewee in the driver's seat and helps them steer their way to impress... [Read More]

    • ASIN: 1946383481
    • ASIN: 1946383481
    • ISBN: 1946383481
    • Manufacturer: Vibrant Publishers

  • High Performance Spark: Best Practices for Scaling and Optimizing Apache Spark

    Apache Spark is amazing when everything clicks. But if you haven’t seen the performance improvements you expected, or still don’t feel confident enough to use Spark in production, this practical book is for you. Authors Holden Karau and Rachel Warren demonstrate performance optimizations to help your Spark queries run faster and handle larger data sizes, while using fewer resources.Ideal for software engineers, data engineers, developers, and system administrators working with large-scale data applications, this book describes techniques that can reduce data infrastructure costs and develo... [Read More]

    • ASIN: 1491943203
    • ASIN: 1491943203
    • ISBN: 9781491943205
    • Manufacturer: O'Reilly Media

  • R for Data Science: Import, Tidy, Transform, Visualize, and Model Data

    Learn how to use R to turn raw data into insight, knowledge, and understanding. This book introduces you to R, RStudio, and the tidyverse, a collection of R packages designed to work together to make data science fast, fluent, and fun. Suitable for readers with no previous programming experience, R for Data Science is designed to get you doing data science as quickly as possible.Authors Hadley Wickham and Garrett Grolemund guide you through the steps of importing, wrangling, exploring, and modeling your data and communicating the results. You’ll get a complete, big-picture understanding of t... [Read More]

    • ASIN: 1491910399
    • ASIN: 1491910399
    • ISBN: 1491910399
    • Brand: O Reilly Media
    • Manufacturer: O'Reilly Media

  • Descriptive Data Mining (Computational Risk Management)

    This book offers an overview of knowledge management. It starts with an introduction to the subject, placing descriptive models in the context of the overall field as well as within the more specific field of data mining analysis. Chapter 2 covers data visualization, including directions for accessing R open source software (described through Rattle). Both R and Rattle are free to students. Chapter 3 then describes market basket analysis, comparing it with more advanced models, and addresses the concept of lift. Subsequently, Chapter 4 describes smarketing RFM models and compares it with more ... [Read More]

    • ASIN: 9811033390
    • ASIN: 9811033390
    • ISBN: 9811033390
    • Manufacturer: Springer

  • Advanced Analytics with Spark: Patterns for Learning from Data at Scale

    In this practical book, four Cloudera data scientists present a set of self-contained patterns for performing large-scale data analysis with Spark. The authors bring Spark, statistical methods, and real-world data sets together to teach you how to approach analytics problems by example.You’ll start with an introduction to Spark and its ecosystem, and then dive into patterns that apply common techniques—classification, collaborative filtering, and anomaly detection among others—to fields such as genomics, security, and finance. If you have an entry-level understanding of machine learning ... [Read More]

    • ASIN: 1491912766
    • ASIN: 1491912766
    • ISBN: 1491912766
    • Brand: O'Reilly Media
    • Manufacturer: O'Reilly Media

  • Learn Hadoop in 1 Day: Master Big Data with this complete Guide

    Hadoop has changed the way large data sets are analyzed, stored, transferred, and processed.  At such low cost, it provides benefits like supports partial failure, fault tolerance, consistency, scalability, flexible schema, and so on. It also supports cloud computing. More and more number of individuals are looking forward to mastering their Hadoop skills.While initiating with Hadoop, most users are unsure about how to proceed with Hadoop.  They are not aware of what are the pre-requisite or data structure they should be familiar with. Or How to make the most efficient use of Hadoop and ... [Read More]

    • ASIN: B01MYRU6W9
    • ASIN: B01MYRU6W9

  • Programming Python: Powerful Object-Oriented Programming

    If you've mastered Python's fundamentals, you're ready to start using it to get real work done. Programming Python will show you how, with in depth tutorials on the language's primary application domains: system administration, GUIs, and the Web. You'll also explore how Python is used in databases, networking, front end scripting layers, text processing, and more. This book focuses on commonly used tools and libraries to give you a comprehensive understanding of Python’s many roles in practical, real world programming.You'll learn language syntax and programming techniques in a clear and con... [Read More]

    • ASIN: 0596158106
    • ASIN: 0596158106
    • ISBN: 0596158106
    • Brand: imusti
    • Manufacturer: O'Reilly Media

  • Practical Hadoop Ecosystem: A Definitive Guide to Hadoop-Related Frameworks and Tools

    Learn how to use the Apache Hadoop projects, including MapReduce, HDFS, Apache Hive, Apache HBase, Apache Kafka, Apache Mahout, and Apache Solr. From setting up the environment to running sample applications each chapter in this book is a practical tutorial on using an Apache Hadoop ecosystem project.While several books on Apache Hadoop are available, most are based on the main projects, MapReduce and HDFS, and none discusses the other Apache Hadoop ecosystem projects and how they all work together as a cohesive big data development platform.What You Will Learn:Set up the environment in Linux ... [Read More]

    • ASIN: 1484221982
    • ASIN: 1484221982
    • ISBN: 1484221982
    • Manufacturer: Apress

  • Hadoop: The Definitive Guide: Storage and Analysis at Internet Scale

    Get ready to unlock the power of your data. With the fourth edition of this comprehensive guide, you’ll learn how to build and maintain reliable, scalable, distributed systems with Apache Hadoop. This book is ideal for programmers looking to analyze datasets of any size, and for administrators who want to set up and run Hadoop clusters.Using Hadoop 2 exclusively, author Tom White presents new chapters on YARN and several Hadoop-related projects such as Parquet, Flume, Crunch, and Spark. You’ll learn about recent changes to Hadoop, and explore new case studies on Hadoop’s role in healthca... [Read More]

    • ASIN: 1491901632
    • ASIN: 1491901632
    • ISBN: 1491901632
    • Brand: O'Reilly Media
    • Manufacturer: O'Reilly Media

  • Open Source Data Warehousing and Business Intelligence

    Open Source Data Warehousing and Business Intelligence is an all-in-one reference for developing open source based data warehousing (DW) and business intelligence (BI) solutions that are business-centric, cross-customer viable, cross-functional, cross-technology based, and enterprise-wide. Considering the entire lifecycle of an open source DW &

    • ASIN: B00OD4L8HI
    • ASIN: B00OD4L8HI
    • Manufacturer: CRC Press

  • The Complete Privacy & Security Desk Reference: Volume I: Digital (Volume 1)

    This textbook, at nearly 500 pages, will explain how to become digitally invisible. You will make all of your communications private, data encrypted, internet connections anonymous, computers hardened, identity guarded, purchases secret, accounts secured, devices locked, and home address hidden. You will remove all personal information from public view and will reclaim your right to privacy. You will no longer give away your intimate details and you will take yourself out of 'the system'. You will use covert aliases and misinformation to eliminate current and future threats toward your privacy... [Read More]

    • ASIN: 152277890X
    • ASIN: 152277890X
    • ISBN: 152277890X
    • Manufacturer: CreateSpace Independent Publishing Platform

  • Practical Apache Spark: Using the Scala API

    Work with Apache Spark using Scala to deploy and set up single-node, multi-node, and high-availability clusters. This book discusses various components of Spark such as Spark Core, DataFrames, Datasets and SQL, Spark Streaming, Spark MLib, and R on Spark with the help of practical code snippets for each topic. Practical Apache Spark also covers the integration of Apache Spark with Kafka with examples. You’ll follow a learn-to-do-by-yourself approach to learning – learn the concepts, practice the code snippets in Scala, and complete the assignments given to get an overall exposure. On comp... [Read More]

    • ASIN: 1484236513
    • ASIN: 1484236513
    • ISBN: 1484236513
    • Manufacturer: Apress

  • Complete Guide to Open Source Big Data Stack

    See a Mesos-based big data stack created and the components used. You will use currently available Apache full and incubating systems. The components are introduced by example and you learn how they work together.In the

    • UPC: 345092402

  • The Definitive Guide to Mongodb : A Complete Guide to Dealing with Big Data Using Mongodb

    THE DEFINITIVE GUIDE TO MONGODB

    • UPC: 27385111

  • How to Choose the Right Database? - MongoDB, Cassandra, MySQL, HBase - Frank Kane

    Big Data Tools and Technologies | Big Data Tools Tutorial | Big Data Training | Simplilearn

    Greenplum Database: The First Open Source Data Warehouse