My Account Log in

1 option

Causal Inference in R : Decipher Complex Relationships with Advanced R Techniques for Data-Driven Decision-making / Subhajit Das.

O'Reilly Online Learning: Academic/Public Library Edition Available online

View online
Format:
Book
Author/Creator:
Das, Subhajit, author.
Language:
English
Subjects (All):
R (Computer program language).
Causation--Statistical methods.
Causation.
Regression analysis.
Decision making--Statistical methods.
Decision making.
Physical Description:
1 online resource (382 pages)
Edition:
First edition.
Place of Publication:
Birmingham, England : Packt Publishing Ltd., [2024]
System Details:
Mode of access: World Wide Web.
Biography/History:
Das Subhajit: Subhajit Das holds a PhD in computer science from Georgia Institute of Technology, USA, specializing in machine learning (ML) and visual analytics. With 10+ years of experience, he is an expert in causal inference, revealing complex relationships and data-driven decision-making. His work has influenced millions in AI, e-commerce, logistics, and 3D software sectors. He has collaborated with leading companies, such as Amazon, Microsoft, Bosch, UPS, 3M, and Autodesk, creating solutions that seamlessly integrate causal reasoning and ML. His research, published in top conferences, focuses on developing AI-powered interactive systems for domain experts. He also holds a master's degree in design computing from the University of Pennsylvania, USA.
Summary:
Determining causality in data is difficult due to confounding factors. Written by an applied scientist specializing in causal inference with over a decade of experience, Causal Inference in R provides the tools and methods you need to accurately establish causal relationships, improving data-driven decision-making. This book helps you get to grips with foundational concepts, offering a clear understanding of causal models and their relevance in data analysis. You’ll progress through chapters that blend theory with hands-on examples, illustrating how to apply advanced statistical methods to real-world scenarios. You’ll discover techniques for establishing causality, from classic approaches to contemporary methods, such as propensity score matching and instrumental variables. Each chapter is enriched with detailed case studies and R code snippets, enabling you to implement concepts immediately. Beyond technical skills, this book also emphasizes critical thinking in data analysis to empower you to make informed, data-driven decisions. The chapters enable you to harness the power of causal inference in R to uncover deeper insights from data. By the end of this book, you’ll be able to confidently establish causal relationships and make data-driven decisions with precision.
Contents:
Cover
Title Page
Copyright and Credits
Dedicated
Contributors
Table of Contents
Preface
Part 1: Foundations of Causal Inference
Chapter 1: Introducing Causal Inference
Defining causal inference
Historical perspective on causal inference
Why do we need causality?
Is it an association or really causation?
Deep dive causality in real-life settings
Exploring the technical aspects of causality
Simpson's paradox
Defining variables
Summary
References
Chapter 2: Unraveling Confounding and Associations
A deep dive into associations
Causality and a fundamental issue
Individual treatment effect
Average treatment effect
The distinction between confounding and associations
Discussing the concept of bias in causality
Assumptions in causal inference
Strategies to address confounding
Regression adjustment
Propensity score methods
Chapter 3: Initiating R with a Basic Causal Inference Example
Technical requirements
What is R? Why use R for causal inference?
Getting started with R
Setting up the R environment
Navigating the RStudio interface
Basic R programming concepts
Data types in R
Advanced data structures
Packages in R
Preparing for causal inference in R
Preparing and loading data
Exploratory data analysis (EDA)
Simple causal inference techniques
Comparing means (t-tests)
Regression analysis
Propensity score matching
Case study - a basic causal analysis in R
Data preparation and inspection
Understanding the data
Performing causal analysis
Part 2: Practical Applications and Core Methods
Chapter 4: Constructing Causality Models with Graphs
Basics of graph theory
Types of graphs - directed versus undirected.
Other graph typologies
Why we need DAGs in causal science
Graph representations of variables
Mathematical interpretation
Representing graphs in R
Bayesian networks
Conditional independence
Exploring Graphical Causal Models
Comparison with Bayesian networks
Assumptions in GCMs
Case study example of a graph model in R
Problem to solve using graphs
Implementing in R
Interpreting results
Chapter 5: Navigating Causal Inference through Directed Acyclic Graphs
Understanding the flow in Graphs
Chains and forks
Colliders
Adjusting for confounding in graphs
D-separation
Do-operator
The back door adjustment
The front door adjustment
Practical R example - back door versus front door
Synthetic data
Back door adjustment in R
Front door adjustment in R
Chapter 6: Employing Propensity Score Techniques
Introduction to propensity scores
A deep dive into these scores
Balancing confounding variables
Check for confounding using propensity scores
Challenges and caveats
Stratification and subsampling
Theory
Application of propensity scores in R
Understanding Propensity Score Matching
Considerations and limitations
Practical application of PSM in R
Balancing methods
Sensitivity analysis
Visualizing the results
Weighting in PSM using R
Chapter 7: Employing Regression Approaches for Causal Inference
Role of regression in causality
Choosing the appropriate regression model
Understanding the nature of the outcome variable
Consideration of confounding and interaction effects
Model complexity, parsimony, and assumptions
Linear regression for causal inference
The theory.
Application of regression modeling in R
Single versus multivariate regression
Treatment orthogonalization
Example of the FWL theorem
Model diagnostics and assumptions
Non-linear regression for causal inference
Other types of non-linear models
Application of a non-linear regression problem in R
Important considerations in regression modeling
Which covariates to consider in the model?
Dummy variables? What are they?
Orthogonalization effect in R
Chapter 8: Executing A/B Testing and Controlled Experiments
Designing and conducting A/B tests
Concepts
Planning your A/B test
Implementation details
Controlled experiments and causal inference
Enhancing causal inference
Beyond A/B testing - multi-armed bandit tests and factorial designs
Ethical considerations
Common pitfalls and challenges
Strategies for dealing with incomplete data
Mitigating spill-over effects
Adaptive experimentation - when and how to adjust your experiment
Implementing A/B test analysis in R
Step 1 - Generating synthetic data
Step 2 - Exploratory data analysis (EDA)
Step 3 - Statistical testing
Step 4 - Multivariate analysis
Step 5 - Interpreting results
Step 6 - Checking assumptions of the t-test
Step 7 - Effect-size calculation
Step 8 - Power analysis
Step 9 - Post-hoc analyses
Step 10 - Visualizing interaction effects
Chapter 9: Implementing Doubly Robust Estimation
What is doubly robust estimation?
An overview of DR estimation
Technique behind DR
Comparison with other estimation methods
Implementing doubly robust estimation in R
Preparing data for DR analysis
Implementing basic DR estimators
Calculating weight
Crafting the DR estimator.
Discussing doubly robust methods
Estimating variance
Advanced DR techniques (using the tmle and SuperLearner packages)
Balancing flexibility and reliability with DR estimation
Part 3: Advanced Topics and Cutting-Edge Methods
Chapter 10: Analyzing Instrumental Variables
Introduction to instrumental variables
The concept of instrumental variables
The importance of instrumental variables in causal inference
Criteria for instrumental variables
Relevance of the instrumental variable
Exogeneity of the instrumental variable
Exclusion restriction
Strategies for identifying valid instrumental variables
Relevance condition
Exogeneity condition
Demonstrating instrumental variable analysis in R
Using gmm for generalized method of moments
Diagnostics and tests in instrumental variable analysis
Interpretation of results
Challenges and limitations of instrumental variable analysis
Weak instrumental variables
Measurement errors in instrumental variables
Interpretation of instrumental variable estimates
Chapter 11: Investigating Mediation Analysis
What is mediation analysis?
Definition and overview
The importance of mediation analysis
Identifying mediation effects
Criteria for mediation
Testing for mediation
Mediation analysis in R
Preparing data for mediation analysis
Conducting mediation analysis
Interpretation and further steps
Advanced mediation models
Chapter 12: Exploring Sensitivity Analysis
Introduction to sensitivity analysis
Why do we need sensitivity analysis?
Historical context
Sensitivity analysis for causal inference.
How do we use sensitivity analysis?
Types of sensitivity analysis
Key concepts and measures
Implementing sensitivity analysis in R
Using R for sensitivity analysis
Visualizing our findings
Case study
Practical guidelines for conducting sensitivity analysis
Choosing parameters for sensitivity analysis
Limitations and challenges
Advanced topics in sensitivity analysis
Venturing beyond binary treatment
ML approaches
Future directions
Chapter 13: Scrutinizing Heterogeneity in Causal Inference
What is heterogeneity?
Definition of heterogeneity in causality
Case studies and discussion
Examples (more of them)
Understanding the types of heterogeneity
Pre-treatment heterogeneity
Post-treatment heterogeneity
Contextual heterogeneity
Heterogeneous causal effects deep dive
Interaction terms in regression models
Subgroup analysis
ML techniques
Estimation methods for identifying HCEs
Regression Discontinuity Designs
Instrumental variables
Propensity Score Matching
Case study - Heterogeneity in R
Generating synthetic data
Exploratory data analysis
Matching for causal inference
Estimating the ATE
Tailoring interventions to different groups
Conceptual framework
Case study 1 - Educational interventions and their varied effects on different student demographics
Case study 2 - Public health campaigns and their differential impacts on various population segments
Chapter 14: Harnessing Causal Forests and Machine Learning Methods
Introduction to causal forests for causal inference
Historical development and key researchers
Theoretical foundations of causal forests
Conditions necessary for causal forest applications.
Advantages and limitations.
Notes:
Includes bibliographical references and index.
Description based on publisher supplied metadata and other sources.
Description based on print version record.
Other Format:
Print version: Das, Subhajit Causal Inference in R
ISBN:
9781803238166
OCLC:
1477882560

The Penn Libraries is committed to describing library materials using current, accurate, and responsible language. If you discover outdated or inaccurate language, please fill out this feedback form to report it and suggest alternative language.

Find

Home Release notes

My Account

Shelf Request an item Bookmarks Fines and fees Settings

Guides

Using the Find catalog Using Articles+ Using your account