From Curve Fitting to Machine Learning

An Illustrative Guide to Scientific Data Analysis and Computational Intelligence

Author: Achim Zielesny

Publisher: Springer Science & Business Media

ISBN: 3642212808

Category: Computers

Page: 465

View: 2097

DOWNLOAD NOW »
The analysis of experimental data is at heart of science from its beginnings. But it was the advent of digital computers that allowed the execution of highly non-linear and increasingly complex data analysis procedures - methods that were completely unfeasible before. Non-linear curve fitting, clustering and machine learning belong to these modern techniques which are a further step towards computational intelligence. The goal of this book is to provide an interactive and illustrative guide to these topics. It concentrates on the road from two dimensional curve fitting to multidimensional clustering and machine learning with neural networks or support vector machines. Along the way topics like mathematical optimization or evolutionary algorithms are touched. All concepts and ideas are outlined in a clear cut manner with graphically depicted plausibility arguments and a little elementary mathematics. The major topics are extensively outlined with exploratory examples and applications. The primary goal is to be as illustrative as possible without hiding problems and pitfalls but to address them. The character of an illustrative cookbook is complemented with specific sections that address more fundamental questions like the relation between machine learning and human intelligence. These sections may be skipped without affecting the main road but they will open up possibly interesting insights beyond the mere data massage. All topics are completely demonstrated with the aid of the commercial computing platform Mathematica and the Computational Intelligence Packages (CIP), a high-level function library developed with Mathematica's programming language on top of Mathematica's algorithms. CIP is open-source so the detailed code of every method is freely accessible. All examples and applications shown throughout the book may be used and customized by the reader without any restrictions. The target readerships are students of (computer) science and engineering as well as scientific practitioners in industry and academia who deserve an illustrative introduction to these topics. Readers with programming skills may easily port and customize the provided code.

From Curve Fitting to Machine Learning

An Illustrative Guide to Scientific Data Analysis and Computational Intelligence

Author: Achim Zielesny

Publisher: Springer

ISBN: 3319325450

Category: Computers

Page: 498

View: 4591

DOWNLOAD NOW »
This successful book provides in its second edition an interactive and illustrative guide from two-dimensional curve fitting to multidimensional clustering and machine learning with neural networks or support vector machines. Along the way topics like mathematical optimization or evolutionary algorithms are touched. All concepts and ideas are outlined in a clear cut manner with graphically depicted plausibility arguments and a little elementary mathematics.The major topics are extensively outlined with exploratory examples and applications. The primary goal is to be as illustrative as possible without hiding problems and pitfalls but to address them. The character of an illustrative cookbook is complemented with specific sections that address more fundamental questions like the relation between machine learning and human intelligence.All topics are completely demonstrated with the computing platform Mathematica and the Computational Intelligence Packages (CIP), a high-level function library developed with Mathematica's programming language on top of Mathematica's algorithms. CIP is open-source and the detailed code used throughout the book is freely accessible.The target readerships are students of (computer) science and engineering as well as scientific practitioners in industry and academia who deserve an illustrative introduction. Readers with programming skills may easily port or customize the provided code. "'From curve fitting to machine learning' is ... a useful book. ... It contains the basic formulas of curve fitting and related subjects and throws in, what is missing in so many books, the code to reproduce the results.All in all this is an interesting and useful book both for novice as well as expert readers. For the novice it is a good introductory book and the expert will appreciate the many examples and working code". Leslie A. Piegl (Review of the first edition, 2012).

Understanding Machine Learning

From Theory to Algorithms

Author: Shai Shalev-Shwartz,Shai Ben-David

Publisher: Cambridge University Press

ISBN: 1107057132

Category: Computers

Page: 409

View: 7396

DOWNLOAD NOW »
Introduces machine learning and its algorithmic paradigms, explaining the principles behind automated learning approaches and the considerations underlying their usage.

Intelligent Systems for Engineers and Scientists, Third Edition

Author: Adrian A. Hopgood

Publisher: CRC Press

ISBN: 1498783791

Category: Computers

Page: 451

View: 8176

DOWNLOAD NOW »
The third edition of this bestseller examines the principles of artificial intelligence and their application to engineering and science, as well as techniques for developing intelligent systems to solve practical problems. Covering the full spectrum of intelligent systems techniques, it incorporates knowledge-based systems, computational intelligence, and their hybrids. Using clear and concise language, Intelligent Systems for Engineers and Scientists, Third Edition features updates and improvements throughout all chapters. It includes expanded and separated chapters on genetic algorithms and single-candidate optimization techniques, while the chapter on neural networks now covers spiking networks and a range of recurrent networks. The book also provides extended coverage of fuzzy logic, including type-2 and fuzzy control systems. Example programs using rules and uncertainty are presented in an industry-standard format, so that you can run them yourself. The first part of the book describes key techniques of artificial intelligence—including rule-based systems, Bayesian updating, certainty theory, fuzzy logic (types 1 and 2), frames, objects, agents, symbolic learning, case-based reasoning, genetic algorithms, optimization algorithms, neural networks, hybrids, and the Lisp and Prolog languages. The second part describes a wide range of practical applications in interpretation and diagnosis, design and selection, planning, and control. The author provides sufficient detail to help you develop your own intelligent systems for real applications. Whether you are building intelligent systems or you simply want to know more about them, this book provides you with detailed and up-to-date guidance. Check out the significantly expanded set of free web-based resources that support the book at: http://www.adrianhopgood.com/aitoolkit/

The R Book

Author: Michael J. Crawley

Publisher: John Wiley & Sons

ISBN: 1118448960

Category: Mathematics

Page: 1080

View: 4697

DOWNLOAD NOW »
Hugely successful and popular text presenting an extensive and comprehensive guide for all R users The R language is recognized as one of the most powerful and flexible statistical software packages, enabling users to apply many statistical techniques that would be impossible without such software to help implement such large data sets. R has become an essential tool for understanding and carrying out research. This edition: Features full colour text and extensive graphics throughout. Introduces a clear structure with numbered section headings to help readers locate information more efficiently. Looks at the evolution of R over the past five years. Features a new chapter on Bayesian Analysis and Meta-Analysis. Presents a fully revised and updated bibliography and reference section. Is supported by an accompanying website allowing examples from the text to be run by the user. Praise for the first edition: ‘…if you are an R user or wannabe R user, this text is the one that should be on your shelf. The breadth of topics covered is unsurpassed when it comes to texts on data analysis in R.’ (The American Statistician, August 2008) ‘The High-level software language of R is setting standards in quantitative analysis. And now anybody can get to grips with it thanks to The R Book…’ (Professional Pensions, July 2007)

Artificial Intelligence for Marketing

Practical Applications

Author: Jim Sterne

Publisher: John Wiley & Sons

ISBN: 1119406331

Category: Business & Economics

Page: 368

View: 2392

DOWNLOAD NOW »
A straightforward, non-technical guide to the next major marketing tool Artificial Intelligence for Marketing presents a tightly-focused introduction to machine learning, written specifically for marketing professionals. This book will not teach you to be a data scientist—but it does explain how Artificial Intelligence and Machine Learning will revolutionize your company's marketing strategy, and teach you how to use it most effectively. Data and analytics have become table stakes in modern marketing, but the field is ever-evolving with data scientists continually developing new algorithms—where does that leave you? How can marketers use the latest data science developments to their advantage? This book walks you through the "need-to-know" aspects of Artificial Intelligence, including natural language processing, speech recognition, and the power of Machine Learning to show you how to make the most of this technology in a practical, tactical way. Simple illustrations clarify complex concepts, and case studies show how real-world companies are taking the next leap forward. Straightforward, pragmatic, and with no math required, this book will help you: Speak intelligently about Artificial Intelligence and its advantages in marketing Understand how marketers without a Data Science degree can make use of machine learning technology Collaborate with data scientists as a subject matter expert to help develop focused-use applications Help your company gain a competitive advantage by leveraging leading-edge technology in marketing Marketing and data science are two fast-moving, turbulent spheres that often intersect; that intersection is where marketing professionals pick up the tools and methods to move their company forward. Artificial Intelligence and Machine Learning provide a data-driven basis for more robust and intensely-targeted marketing strategies—and companies that effectively utilize these latest tools will reap the benefit in the marketplace. Artificial Intelligence for Marketing provides a nontechnical crash course to help you stay ahead of the curve.

Intelligent Production Machines and Systems - 2nd I*PROMS Virtual International Conference 3-14 July 2006

Author: Duc T. Pham,Eldaw E. Eldukhri,Anthony J. Soroka

Publisher: Elsevier

ISBN: 0080556345

Category: Technology & Engineering

Page: 688

View: 4959

DOWNLOAD NOW »
I*PROMS 2005 is an online web-based conference. It provides a platform for presenting, discussing, and disseminating research results contributed by scientists and industrial practitioners active in the area of intelligent systems and soft computing techniques (such as fuzzy logic, neural networks, evolutionary algorithms, and knowledge-based systems) and their application in different areas of manufacturing. Comprised of 100 peer-reviewed articles, this important resource provides tools to help enterprises achieve goals critical to the future of manufacturing. I*PROMS is an European Union-funded network that involves 30 partner organizations and more than 130 researchers from universities, research organizations, and corporations. * State-of-the-art research results * Leading European researchers and industrial practitioners * Comprehensive collection of indexed and peer-reviewed articles in book format supported by a user-friendly full-text CD-ROM with search functionality

Deep Learning with Python

Author: Francois Chollet

Publisher: Manning Publications

ISBN: 9781617294433

Category: Machine learning

Page: 384

View: 9994

DOWNLOAD NOW »
Summary Deep Learning with Python introduces the field of deep learning using the Python language and the powerful Keras library. Written by Keras creator and Google AI researcher Fran�ois Chollet, this book builds your understanding through intuitive explanations and practical examples. Purchase of the print book includes a free eBook in PDF, Kindle, and ePub formats from Manning Publications. About the Technology Machine learning has made remarkable progress in recent years. We went from near-unusable speech and image recognition, to near-human accuracy. We went from machines that couldn't beat a serious Go player, to defeating a world champion. Behind this progress is deep learning--a combination of engineering advances, best practices, and theory that enables a wealth of previously impossible smart applications. About the Book Deep Learning with Python introduces the field of deep learning using the Python language and the powerful Keras library. Written by Keras creator and Google AI researcher Fran�ois Chollet, this book builds your understanding through intuitive explanations and practical examples. You'll explore challenging concepts and practice with applications in computer vision, natural-language processing, and generative models. By the time you finish, you'll have the knowledge and hands-on skills to apply deep learning in your own projects. What's Inside Deep learning from first principles Setting up your own deep-learning environment Image-classification models Deep learning for text and sequences Neural style transfer, text generation, and image generation About the Reader Readers need intermediate Python skills. No previous experience with Keras, TensorFlow, or machine learning is required. About the Author Fran�ois Chollet works on deep learning at Google in Mountain View, CA. He is the creator of the Keras deep-learning library, as well as a contributor to the TensorFlow machine-learning framework. He also does deep-learning research, with a focus on computer vision and the application of machine learning to formal reasoning. His papers have been published at major conferences in the field, including the Conference on Computer Vision and Pattern Recognition (CVPR), the Conference and Workshop on Neural Information Processing Systems (NIPS), the International Conference on Learning Representations (ICLR), and others. Table of Contents PART 1 - FUNDAMENTALS OF DEEP LEARNING What is deep learning? Before we begin: the mathematical building blocks of neural networks Getting started with neural networks Fundamentals of machine learning PART 2 - DEEP LEARNING IN PRACTICE Deep learning for computer vision Deep learning for text and sequences Advanced deep-learning best practices Generative deep learning Conclusions appendix A - Installing Keras and its dependencies on Ubuntu appendix B - Running Jupyter notebooks on an EC2 GPU instance

Pattern Recognition and Machine Learning

Author: Christopher M. Bishop

Publisher: Springer

ISBN: 9781493938438

Category: Computers

Page: 738

View: 1039

DOWNLOAD NOW »
This is the first textbook on pattern recognition to present the Bayesian viewpoint. The book presents approximate inference algorithms that permit fast approximate answers in situations where exact answers are not feasible. It uses graphical models to describe probability distributions when no other books apply graphical models to machine learning. No previous knowledge of pattern recognition or machine learning concepts is assumed. Familiarity with multivariate calculus and basic linear algebra is required, and some experience in the use of probabilities would be helpful though not essential as the book includes a self-contained introduction to basic probability theory.

Computer Vision

Algorithms and Applications

Author: Richard Szeliski

Publisher: Springer

ISBN: 9781848829466

Category: Computers

Page: 812

View: 8678

DOWNLOAD NOW »
Humans perceive the three-dimensional structure of the world with apparent ease. However, despite all of the recent advances in computer vision research, the dream of having a computer interpret an image at the same level as a two-year old remains elusive. Why is computer vision such a challenging problem and what is the current state of the art? Computer Vision: Algorithms and Applications explores the variety of techniques commonly used to analyze and interpret images. It also describes challenging real-world applications where vision is being successfully used, both for specialized applications such as medical imaging, and for fun, consumer-level tasks such as image editing and stitching, which students can apply to their own personal photos and videos. More than just a source of “recipes,” this exceptionally authoritative and comprehensive textbook/reference also takes a scientific approach to basic vision problems, formulating physical models of the imaging process before inverting them to produce descriptions of a scene. These problems are also analyzed using statistical models and solved using rigorous engineering techniques Topics and features: structured to support active curricula and project-oriented courses, with tips in the Introduction for using the book in a variety of customized courses; presents exercises at the end of each chapter with a heavy emphasis on testing algorithms and containing numerous suggestions for small mid-term projects; provides additional material and more detailed mathematical topics in the Appendices, which cover linear algebra, numerical techniques, and Bayesian estimation theory; suggests additional reading at the end of each chapter, including the latest research in each sub-field, in addition to a full Bibliography at the end of the book; supplies supplementary course material for students at the associated website, http://szeliski.org/Book/. Suitable for an upper-level undergraduate or graduate-level course in computer science or engineering, this textbook focuses on basic techniques that work under real-world conditions and encourages students to push their creative boundaries. Its design and exposition also make it eminently suitable as a unique reference to the fundamental techniques and current research literature in computer vision.

Deep Learning for Medical Image Analysis

Author: S. Kevin Zhou,Hayit Greenspan,Dinggang Shen

Publisher: Academic Press

ISBN: 0128104090

Category: Technology & Engineering

Page: 458

View: 542

DOWNLOAD NOW »
Deep learning is providing exciting solutions for medical image analysis problems and is seen as a key method for future applications. This book gives a clear understanding of the principles and methods of neural network and deep learning concepts, showing how the algorithms that integrate deep learning as a core component have been applied to medical image detection, segmentation and registration, and computer-aided analysis, using a wide variety of application areas. Deep Learning for Medical Image Analysis is a great learning resource for academic and industry researchers in medical imaging analysis, and for graduate students taking courses on machine learning and deep learning for computer vision and medical image computing and analysis. Covers common research problems in medical image analysis and their challenges Describes deep learning methods and the theories behind approaches for medical image analysis Teaches how algorithms are applied to a broad range of application areas, including Chest X-ray, breast CAD, lung and chest, microscopy and pathology, etc. Includes a Foreword written by Nicholas Ayache

Data Mining

Practical Machine Learning Tools and Techniques

Author: Ian H. Witten,Eibe Frank,Mark A. Hall,Christopher J. Pal

Publisher: Morgan Kaufmann

ISBN: 0128043571

Category: Computers

Page: 654

View: 1625

DOWNLOAD NOW »
Data Mining: Practical Machine Learning Tools and Techniques, Fourth Edition, offers a thorough grounding in machine learning concepts, along with practical advice on applying these tools and techniques in real-world data mining situations. This highly anticipated fourth edition of the most acclaimed work on data mining and machine learning teaches readers everything they need to know to get going, from preparing inputs, interpreting outputs, evaluating results, to the algorithmic methods at the heart of successful data mining approaches. Extensive updates reflect the technical changes and modernizations that have taken place in the field since the last edition, including substantial new chapters on probabilistic methods and on deep learning. Accompanying the book is a new version of the popular WEKA machine learning software from the University of Waikato. Authors Witten, Frank, Hall, and Pal include today's techniques coupled with the methods at the leading edge of contemporary research. Please visit the book companion website at http://www.cs.waikato.ac.nz/ml/weka/book.html It contains Powerpoint slides for Chapters 1-12. This is a very comprehensive teaching resource, with many PPT slides covering each chapter of the book Online Appendix on the Weka workbench; again a very comprehensive learning aid for the open source software that goes with the book Table of contents, highlighting the many new sections in the 4th edition, along with reviews of the 1st edition, errata, etc. Provides a thorough grounding in machine learning concepts, as well as practical advice on applying the tools and techniques to data mining projects Presents concrete tips and techniques for performance improvement that work by transforming the input or output in machine learning methods Includes a downloadable WEKA software toolkit, a comprehensive collection of machine learning algorithms for data mining tasks-in an easy-to-use interactive interface Includes open-access online courses that introduce practical applications of the material in the book

Intelligent Natural Language Processing: Trends and Applications

Author: Khaled Shaalan,Aboul Ella Hassanien,Fahmy Tolba

Publisher: Springer

ISBN: 3319670565

Category: Computers

Page: 776

View: 372

DOWNLOAD NOW »
This book brings together scientists, researchers, practitioners, and students from academia and industry to present recent and ongoing research activities concerning the latest advances, techniques, and applications of natural language processing systems, and to promote the exchange of new ideas and lessons learned. Taken together, the chapters of this book provide a collection of high-quality research works that address broad challenges in both theoretical and applied aspects of intelligent natural language processing. The book presents the state-of-the-art in research on natural language processing, computational linguistics, applied Arabic linguistics and related areas. New trends in natural language processing systems are rapidly emerging – and finding application in various domains including education, travel and tourism, and healthcare, among others. Many issues encountered during the development of these applications can be resolved by incorporating language technology solutions. The topics covered by the book include: Character and Speech Recognition; Morphological, Syntactic, and Semantic Processing; Information Extraction; Information Retrieval and Question Answering; Text Classification and Text Mining; Text Summarization; Sentiment Analysis; Machine Translation Building and Evaluating Linguistic Resources; and Intelligent Language Tutoring Systems.

Neural Network Programming with Java

Author: Fabio M. Soares,Alan M. F. Souza

Publisher: Packt Publishing Ltd

ISBN: 1787122972

Category: Computers

Page: 270

View: 9754

DOWNLOAD NOW »
Create and unleash the power of neural networks by implementing professional Java code About This Book Learn to build amazing projects using neural networks including forecasting the weather and pattern recognition Explore the Java multi-platform feature to run your personal neural networks everywhere This step-by-step guide will help you solve real-world problems and links neural network theory to their application Who This Book Is For This book is for Java developers who want to know how to develop smarter applications using the power of neural networks. Those who deal with a lot of complex data and want to use it efficiently in their day-to-day apps will find this book quite useful. Some basic experience with statistical computations is expected. What You Will Learn Develop an understanding of neural networks and how they can be fitted Explore the learning process of neural networks Build neural network applications with Java using hands-on examples Discover the power of neural network's unsupervised learning process to extract the intrinsic knowledge hidden behind the data Apply the code generated in practical examples, including weather forecasting and pattern recognition Understand how to make the best choice of learning parameters to ensure you have a more effective application Select and split data sets into training, test, and validation, and explore validation strategies In Detail Want to discover the current state-of-art in the field of neural networks that will let you understand and design new strategies to apply to more complex problems? This book takes you on a complete walkthrough of the process of developing basic to advanced practical examples based on neural networks with Java, giving you everything you need to stand out. You will first learn the basics of neural networks and their process of learning. We then focus on what Perceptrons are and their features. Next, you will implement self-organizing maps using practical examples. Further on, you will learn about some of the applications that are presented in this book such as weather forecasting, disease diagnosis, customer profiling, generalization, extreme machine learning, and characters recognition (OCR). Finally, you will learn methods to optimize and adapt neural networks in real time. All the examples generated in the book are provided in the form of illustrative source code, which merges object-oriented programming (OOP) concepts and neural network features to enhance your learning experience. Style and approach This book takes you on a steady learning curve, teaching you the important concepts while being rich in examples. You'll be able to relate to the examples in the book while implementing neural networks in your day-to-day applications.

Decision Making and Soft Computing

Proceedings of the 11th International FLINS Conference

Author: Ronei Marcos de Moraes,Etienne E Kerre,Liliane dos Santos Machado,Jie Lu

Publisher: World Scientific

ISBN: 9814623008

Category: Computers

Page: 780

View: 3930

DOWNLOAD NOW »
FLINS, originally an acronym for Fuzzy Logic and Intelligent Technologies in Nuclear Science, is now extended to Computational Intelligence for applied research. The contributions to the 11th of FLINS conference cover state-of-the-art research, development, and technology for computational intelligence systems, both from the foundations and the applications points-of-view. Contents:Invited Lectures:The Contribution of Fuzzy Sets to Decision Sciences (D Dubois)Granular Fuzzy Systems: A New Direction in Soft Computing and Human Centric Decision-Making (Witold Pedrycz)Some Approaches Towards Lattice Computing in Mathematical Morphology and Computational Intelligence (Peter Sussner)Decision Making and Decision Support SystemsStatistics, Data Analysis and Data MiningFoundations of Computational IntelligenceSoft Computing and Applied ResearchIntelligent Systems and Knowledge EngineeringUncertainty ModelingIntelligent Information Processing Readership: Graduate students, researchers, and academics in artificial intelligence/machine learning, information management, decision sciences, databases/information sciences and fuzzy logic. Keywords:FLINS 2014;Soft Computing;Knowledge Engineering;Decision Making

Planning Algorithms

Author: Steven M. LaValle

Publisher: Cambridge University Press

ISBN: 1139455176

Category: Computers

Page: N.A

View: 8726

DOWNLOAD NOW »
Planning algorithms are impacting technical disciplines and industries around the world, including robotics, computer-aided design, manufacturing, computer graphics, aerospace applications, drug design, and protein folding. This coherent and comprehensive book unifies material from several sources, including robotics, control theory, artificial intelligence, and algorithms. The treatment is centered on robot motion planning, but integrates material on planning in discrete spaces. A major part of the book is devoted to planning under uncertainty, including decision theory, Markov decision processes, and information spaces, which are the 'configuration spaces' of all sensor-based planning problems. The last part of the book delves into planning under differential constraints that arise when automating the motions of virtually any mechanical system. This text and reference is intended for students, engineers, and researchers in robotics, artificial intelligence, and control theory as well as computer graphics, algorithms, and computational biology.

A Field Guide to Genetic Programming

Author: N.A

Publisher: Lulu.com

ISBN: 1409200736

Category: Computers

Page: 233

View: 2081

DOWNLOAD NOW »
Genetic programming (GP) is a systematic, domain-independent method for getting computers to solve problems automatically starting from a high-level statement of what needs to be done. Using ideas from natural evolution, GP starts from an ooze of random computer programs, and progressively refines them through processes of mutation and sexual recombination, until high-fitness solutions emerge. All this without the user having to know or specify the form or structure of solutions in advance. GP has generated a plethora of human-competitive results and applications, including novel scientific discoveries and patentable inventions. This unique overview of this exciting technique is written by three of the most active scientists in GP. See www.gp-field-guide.org.uk for more information on the book.

IBM PowerAI: Deep Learning Unleashed on IBM Power Systems Servers

Author: Dino Quintero,Bing He,Bruno C. Faria,Alfonso Jara,Chris Parsons,Shota Tsukamoto,Richard Wale,IBM Redbooks

Publisher: IBM Redbooks

ISBN: 0738442941

Category: Computers

Page: 278

View: 7758

DOWNLOAD NOW »
This IBM® Redbooks® publication is a guide about the IBM PowerAI Deep Learning solution. This book provides an introduction to artificial intelligence (AI) and deep learning (DL), IBM PowerAI, and components of IBM PowerAI, deploying IBM PowerAI, guidelines for working with data and creating models, an introduction to IBM SpectrumTM Conductor Deep Learning Impact (DLI), and case scenarios. IBM PowerAI started as a package of software distributions of many of the major DL software frameworks for model training, such as TensorFlow, Caffe, Torch, Theano, and the associated libraries, such as CUDA Deep Neural Network (cuDNN). The IBM PowerAI software is optimized for performance by using the IBM Power SystemsTM servers that are integrated with NVLink. The AI stack foundation starts with servers with accelerators. graphical processing unit (GPU) accelerators are well-suited for the compute-intensive nature of DL training, and servers with the highest CPU to GPU bandwidth, such as IBM Power Systems servers, enable the high-performance data transfer that is required for larger and more complex DL models. This publication targets technical readers, including developers, IT specialists, systems architects, brand specialist, sales team, and anyone looking for a guide about how to understand the IBM PowerAI Deep Learning architecture, framework configuration, application and workload configuration, and user infrastructure.

Frontiers in Massive Data Analysis

Author: National Research Council,Division on Engineering and Physical Sciences,Board on Mathematical Sciences and Their Applications,Committee on Applied and Theoretical Statistics,Committee on the Analysis of Massive Data

Publisher: National Academies Press

ISBN: 0309287812

Category: Mathematics

Page: 190

View: 5926

DOWNLOAD NOW »
Data mining of massive data sets is transforming the way we think about crisis response, marketing, entertainment, cybersecurity and national intelligence. Collections of documents, images, videos, and networks are being thought of not merely as bit strings to be stored, indexed, and retrieved, but as potential sources of discovery and knowledge, requiring sophisticated analysis techniques that go far beyond classical indexing and keyword counting, aiming to find relational and semantic interpretations of the phenomena underlying the data. Frontiers in Massive Data Analysis examines the frontier of analyzing massive amounts of data, whether in a static database or streaming through a system. Data at that scale--terabytes and petabytes--is increasingly common in science (e.g., particle physics, remote sensing, genomics), Internet commerce, business analytics, national security, communications, and elsewhere. The tools that work to infer knowledge from data at smaller scales do not necessarily work, or work well, at such massive scale. New tools, skills, and approaches are necessary, and this report identifies many of them, plus promising research directions to explore. Frontiers in Massive Data Analysis discusses pitfalls in trying to infer knowledge from massive data, and it characterizes seven major classes of computation that are common in the analysis of massive data. Overall, this report illustrates the cross-disciplinary knowledge--from computer science, statistics, machine learning, and application disciplines--that must be brought to bear to make useful inferences from massive data.