My Account Log in

1 option

R Bioinformatics Cookbook : Utilize R Packages for Bioinformatics, Genomics, Data Science, and Machine Learning / Dan MacLean.

O'Reilly Online Learning: Academic/Public Library Edition Available online

View online
Format:
Book
Author/Creator:
Maclean, Dan, author.
Language:
English
Subjects (All):
Bioinformatics.
R (Computer program language).
Computational biology.
Physical Description:
1 online resource (396 pages)
Edition:
Second edition.
Place of Publication:
Birmingham, England : Packt Publishing Ltd., [2023]
Summary:
The updated second edition of R Bioinformatics Cookbook takes a recipe-based approach to show you how to conduct practical research and analysis in computational biology with R. You’ll learn how to create a useful and modular R working environment, along with loading, cleaning, and analyzing data using the most up-to-date Bioconductor, ggplot2, and tidyverse tools. This book will walk you through the Bioconductor tools necessary for you to understand and carry out protocols in RNA-seq and ChIP-seq, phylogenetics, genomics, gene search, gene annotation, statistical analysis, and sequence analysis. As you advance, you'll find out how to use Quarto to create data-rich reports, presentations, and websites, as well as get a clear understanding of how machine learning techniques can be applied in the bioinformatics domain. The concluding chapters will help you develop proficiency in key skills, such as gene annotation analysis and functional programming in purrr and base R. Finally, you'll discover how to use the latest AI tools, including ChatGPT, to generate, edit, and understand R code and draft workflows for complex analyses. By the end of this book, you'll have gained a solid understanding of the skills and techniques needed to become a bioinformatics specialist and efficiently work with large and complex bioinformatics datasets.
Contents:
Cover
Title Page
Copyright and Credits
Dedications
Contributors
Table of Contents
Preface
Chapter 1: Setting Up Your R Bioinformatics Working Environment
Technical requirements
Further information
Setting up an R project in a directory
Getting ready
How to do it…
How it works…
There's more…
Using the here package to simplify working with paths
Using the devtools package to work with the latest non-CRAN packages
Setting up your machine for the compilation of source packages
See also
Using the renv package to create a project-specific set of packages
Installing and managing different versions of Bioconductor packages in environments
Using bioconda to install external tools
swGetting ready
Chapter 2: Loading, Tidying, and Cleaning Data in the tidyverse
Loading data from files with readr
Tidying a wide format table into a tidy table with tidyr
Tidying a long format table into a tidy table with tidyr
Combining tables using join functions
Reformatting and extracting existing data into new columns using stringr
How it works….
Computing new data columns from existing ones and applying arbitrary functions using mutate()
Using dplyr to summarize data in large tables
Using datapasta to create R objects from cut-and-paste data
Chapter 3: ggplot2 and Extensions for Publication Quality Plots
Combining many plot types in ggplot2
Comparing changes in distributions with ggridges
Customizing plots with ggeasy
Highlighting selected values in busy plots with gghighlight
Plotting variability and confidence intervals better with ggdist
Making interactive plots with plotly
Clarifying label placement with ggrepel
Zooming and making callouts from selected plot sections with facetzoom
Chapter 4: Using Quarto to Make Data-Rich Reports, Presentations, and Websites
Using Markdown and Quarto for literate computation
Creating different document formats from the same source
Creating data-rich presentations from code
Getting ready.
How to do it…
Creating websites from collections of Quarto documents
Adding interactivity with Shiny
Chapter 5: Easily Performing Statistical Tests Using Linear Models
Modeling data with a linear model
Using a linear model to compare the mean of two groups
Using a linear model and ANOVA to compare multiple groups in a single variable
Using linear models and ANOVA to compare multiple groups in multiple variables
Testing and accounting for interactions between variables in linear models
Doing tests for differences in data in two categorical variables
Making predictions using linear models
Chapter 6: Performing Quantitative RNA-seq
Estimating differential expression with edgeR
Estimating differential expression with DESeq2
There's more...
Estimating differential expression with Kallisto and Sleuth
Using Sleuth to analyze time course experiments
Analyzing splice variants with SGSeq
Performing power analysis with powsimR
Finding unannotated transcribed regions
Finding regions showing high expression ab initio using bumphunter
Differential peak analysis
Estimating batch effects with SVA
Finding allele-specific expression with AllelicImbalance
Presenting RNA-Seq data using ComplexHeatmap
Chapter 7: Finding Genetic Variants with HTS Data
Finding SNPs and INDELs from sequence data using VariantTools
Predicting open reading frames in long reference sequences
Plotting features on genetic maps with karyoploteR
Selecting and classifying variants with VariantAnnotation
Extracting information in genomic regions of interest
Finding phenotype and genotype associations with GWAS
Estimating the copy number at a locus of interest
Chapter 8: Searching Gene and Protein Sequences for Domains and Motifs
Technical requirements.
Further information
Finding DNA motifs with universalmotif
Finding protein domains with PFAM and bio3d
Finding InterPro domains
See also…
Finding transmembrane domains with tmhmm and pureseqTM
Creating figures of protein domains using drawProteins
Performing multiple alignments of proteins or genes
Aligning genomic length sequences with DECIPHER
Novel feature detection in proteins
3D structure protein alignment in bio3d
Chapter 9: Phylogenetic Analysis and Visualization
Reading and writing varied tree formats with ape and treeio
Visualizing trees of many genes quickly with ggtree
Quantifying and estimating the differences between trees with treespace
Extracting and working with subtrees using ape
Creating dot plots for alignment visualizations
Reconstructing trees from alignments using phangorn
Finding orthologue candidates using reciprocal BLASTs.
Notes:
Includes index.
Includes bibliographical references and index.
Description based on print version record.
ISBN:
9781837633821
1837633827
OCLC:
1407627208

The Penn Libraries is committed to describing library materials using current, accurate, and responsible language. If you discover outdated or inaccurate language, please fill out this feedback form to report it and suggest alternative language.

My Account

Shelf Request an item Bookmarks Fines and fees Settings

Guides

Using the Library Catalog Using Articles+ Library Account