Data Mining for Bioinformatics

Sumeet Dua, Pradeep Chowriappa

Hardback
$75.16

eBook
from $42.00

November 6, 2012 by CRC Press
Professional - 348 Pages - 92 B/W Illustrations
ISBN 9780849328015 - CAT# 2801

FREE Standard Shipping!

was $93.95

$75.16

SAVE $18.79

Add to Cart
Add to Wish List

Features

  • Discusses design principles of data mining systems for bioinformatics
  • Examines data mining of protein data, gene expression data, and medical image data
  • Includes data cleansing, translation and transformation, and dimensionality reduction

Summary

Covering theory, algorithms, and methodologies, as well as data mining technologies, Data Mining for Bioinformatics provides a comprehensive discussion of data-intensive computations used in data mining with applications in bioinformatics. It supplies a broad, yet in-depth, overview of the application domains of data mining for bioinformatics to help readers from both biology and computer science backgrounds gain an enhanced understanding of this cross-disciplinary field.

The book offers authoritative coverage of data mining techniques, technologies, and frameworks used for storing, analyzing, and extracting knowledge from large databases in the bioinformatics domains, including genomics and proteomics. It begins by describing the evolution of bioinformatics and highlighting the challenges that can be addressed using data mining techniques. Introducing the various data mining techniques that can be employed in biological databases, the text is organized into four sections:

  1. Supplies a complete overview of the evolution of the field and its intersection with computational learning
  2. Describes the role of data mining in analyzing large biological databases—explaining the breath of the various feature selection and feature extraction techniques that data mining has to offer
  3. Focuses on concepts of unsupervised learning using clustering techniques and its application to large biological data
  4. Covers supervised learning using classification techniques most commonly used in bioinformatics—addressing the need for validation and benchmarking of inferences derived using either clustering or classification

The book describes the various biological databases prominently referred to in bioinformatics and includes a detailed list of the applications of advanced clustering algorithms used in bioinformatics. Highlighting the challenges encountered during the application of classification on biological databases, it considers systems of both single and ensemble classifiers and shares effort-saving tips for model selection and performance estimation strategies.