My Account Log in

1 option

Using Amazon Bedrock : learn to architect, secure and optimize generative AI applications on AWS / Renaldi Gondosubroto.

O'Reilly Online Learning: Academic/Public Library Edition Available online

View online
Format:
Book
Author/Creator:
Gondosubroto, Renaldi, author.
Series:
Tech today.
Tech today
Language:
English
Subjects (All):
Amazon Web Services (Firm).
Artificial intelligence.
Web applications--Development.
Web applications.
Physical Description:
1 online resource (483 pages).
Edition:
1st ed.
Place of Publication:
Newark : John Wiley & Sons, Incorporated, [2026]
Summary:
A from-scratch roadmap to building generative AI solutions on AWS with Amazon Bedrock In Using Amazon Bedrock: Learn to Architect, Secure and Optimize Generative AI Applications on AWS , accomplished Software Engineer, developer advocate, and AWS Community Builder, Renaldi Gondosubroto, delivers an in-depth walkthrough of Amazon Bedrock.
Contents:
Introduction
What Does This Book Cover?
Who Should Read This Book
Part 1 Introduction to Your New Generative AI Playground
Chapter 1 Introduction to Generative AI on AWS
The Current Generative AI Landscape
Introduction to Foundational Models
Core Technologies
Key Attributes
Transfer Learning
Generalization and Adaptation
Creativity
Emergent Behavior
Interoperability
Efficiency Challenges
Recent Advances in Generative AI
Industries Using Solutions with Amazon Bedrock
Healthcare
Retail
Finance
Advertising
Tasks Performed by Generative AI
Summarization
Software Development
Question Answering and Knowledge Retrieval
Chatbot Enhancement
Generative Design
Ethics and Security
Ethics
Security
Technical Challenges
How Amazon Bedrock Approaches AI Challenges
Fairness
Explainability
Privacy and Security
Robustness
Governance
Transparency
Life Cycle of a Generative AI Solution
Problem Definition and Scope
Data Collection and Preparation
Model Selection and Tuning
Training and Evaluation
Integration and Deployment
Monitoring, Security, and Compliance
Feedback and Improvement
Tech Setup for Using Generative AI on AWS
Python for Developing Generative AI Applications
Understanding How Bedrock Works
Understanding Pricing in Bedrock
Amazon Bedrock Capabilities
Forms of Generation
Text
Images
Embeddings
A One-Stop Solution
Amazon Nova
DeepSeek
Claude
Stable Diffusion
Llama
Command and Embed
Mistral
Creating Knowledge Bases
Fine-Tuning
Agents
Safeguards
Spaces for Experimentation
Amazon Bedrock Playground
PartyRock.
Working with Other AWS Services
Unified Data Access
Security and Authorization
Monitoring and Logging
AI Assistants
A Customer Service Chatbot Solution with Bedrock
Summary
Chapter 2 Prompt Engineering with Foundational Models on AWS
Crafting Good Prompts
Best Practices in Crafting Good Prompts
Clarity and Specificity
Context and Background
Iteration and Feedback
Break the Tasks into Steps
Testing and Experimentation
Advanced Techniques
Chain of Thought Reasoning
Meta Prompting
Templating
Using the Model Context Protocol for Dynamic Prompting
Tackling Malintent in Prompts
Prompt Injection
Jailbreak Attempts
Hallucinations
Evaluating Prompt Performance
Learning About Zero-, One- and Few-Shot Inference
Zero-Shot Inference
One-Shot Inference
Few-Shot Inference
Configuring Your Bedrock Environment
Working with Parameters for Foundational Models
Controlling Randomness and Diversity of Responses
Temperature
Top P (Nucleus Sampling)
Top K
Controlling Length
Max Tokens
Stop Tokens
Repetition Penalty
Cost Optimization Based on Prompting
Experimenting with Playgrounds Provided by AWS
PartyRock
Building Your First PartyRock App
Configuring Your Generated PartyRock App
Practical Exercise: Finding the Right Prompt
Part 2 Core Generative AI on AWS
Chapter 3 Building Applications with the Amazon Bedrock API
Overview of Using the Amazon Bedrock API
API Endpoints for Using Bedrock
Invoking the Model
Using Textual Models Through the API
Configuring the Appropriate Parameters
Inference Parameters
Troubleshooting the Parameters
Formatting the Right API Calls
Streaming Responses
Handling Exceptions
Prompt Caching for Cost and Latency Optimization.
Cost Optimization and Architectural Best Practices
Using Stable Diffusion for Image Generation
Using the Bedrock Image Playground
Coding Text-to-Image Implementation
Maintaining Context in Generative AI Chatbots
Understanding the Requirements of a Generative AI Web App
Architecting the Application Concept and Functionality
Choose Your Technology Stack
Set Up the Backend
Integrate the AI Model
Develop the Front-End
Handle the User Input and Display AI Responses
Test and Iterative Improvements for Model Performance
Deploy
Building Your First Generative AI Flask Application
Chapter 4 Working with Multimodal Foundational Models
Multimodal Foundational Models
Creating Prompts for Multimodal Foundational Models
Text and Multimodal Foundational Models: Key Differences
Integration of Multiple Inputs
Rich Contextual Information
Visual Detail Specificity
Dealing with Ambiguity
Iterative Feedback
Best Practices for Working with Multimodal Foundational Models
Well-Crafted Prompts
Balanced Detail
Cultural and Societal Sensitivity
Syntactic Consistency
Iterative Testing and Refinement
Leveraging Model Strengths
Encouraging Creativity and Diversity
Applying Model Architecture and Training Data in Prompts
Enhancing Context with Multimodal Foundational Models
Image-to-Image Transformation
Image Inpainting
Image Outpainting
Visual Question Answering
Image Captioning
Working with Amazon SageMaker JumpStart
Preparation
Configuration of Permissions and Variables
Model Retrieval and Endpoint Deployment
Endpoint Interaction and Response Handling
Practical Exercise: Creating a Movie Recognizer
Data
Search Index
Retrieving Results
Demo Application
Summary.
Chapter 5 Fine-Tuning Foundational Models on AWS
Fine-Tuning Foundational Models
Hyperparameters for Fine-Tuning
Epoch
Batch Size
Learning Rate
Learning Rate Warmup Steps
Putting This into Practice
Instruction-Based Fine-Tuning on AWS
Creating a Dataset for Fine-Tuning from Your Output
Formatting of a Multitask Instruction Dataset
Converting a Dataset to Be Used for Instruction
Fine-Tuning with Instruction on Bedrock
Setting Up for Fine-Tuning
Installing Libraries Necessary to Perform Fine-Tuning
Importing Libraries and Initializing Variables
Creating an IAM Role and Policy for Using Fine-Tuning
Writing the Instruction and Iterating Over Data Points
Defining the data_transform Function and Processing Data Partitions
Converting and Uploading the Datasets for Use in Fine-Tuning
Creating the Fine-Tuning Model Based on Llama 3.3
Using Tokenization
Using fmeval
Fetching Variables from Before and Importing Relevant Libraries
Creating the Fine-Tuning Job
Creating Provisioned Model Throughput
Monitoring the Status of the Deployment of the Model
Invoking the Fine-Tuning Model
Cleaning Up the Resources
Fine-Tuning on Amazon SageMaker JumpStart
Amazon SageMaker JumpStart
Deploying a Llama Model with Amazon SageMaker
Fine-Tuning Using SageMaker Studio
Using SageMaker JumpStart in SageMaker Studio to Fine-Tune
Cleaning Up Endpoints
Advanced Considerations for Fine-Tuning
Chapter 6 Performing Retrieval-Augmented Generation on AWS
Retrieval-Augmented Generation
In-Depth Explanation of RAG
Broad Spectrum of RAG Applications
Advantages and Ethical Considerations
RAG Compared with Other AI Models
Practical Limitations of RAG
Fundamentals of RAG on AWS
Process of Retrieval-Augmented Generation.
Selecting an Appropriate Data Source
Selecting Data
Define the Scope and Requirements
Assess Data Relevance and Diversity
Ensure Data Quality
Consider Data Security and Compliance
Use of Metadata and Annotations
Testing and Iteration
Selecting a Vector Database
Role of Vector Databases in RAG
Practical Application
Selecting the Appropriate Vector Databases
Refining Embeddings for RAG Use in Vaccine Information Retrieval
Selecting an Embeddings Model
Configuring LangChain
Preparing Your Data
Question Answering
Using Agents for Working with Foundational Models
Working with Agents for Amazon Bedrock
Uploading Datasets to Amazon S3
Creating a Knowledge Base
Creating an Agent from the AWS Web Interface
Setting Up a Question-Answering Task
Chapter 7 Optimizing Performance for Foundational Models
The Challenges of Compute and Memory with LLMs
Memory Management
Compute Resources
Inference Optimization
Evaluation and Refining Performance of Foundational Models
Reinforcement Learning
Approaches to Reinforcement Learning for Book Reviews
Learning Loop for Book Reviews in Bedrock
Automatic Model Evaluations
Human Worker-Based Model Evaluation Jobs
Model Evaluation Tasks
Text Generation
Question and Answer
Classification
Creating the Appropriate Prompt Datasets
Built-in Prompt Datasets
Custom Prompt Datasets
Understanding Model Evaluation Job Results
Automated Reports
Human Report Cards
Distributed Computing Approaches to Achieving Optimization
Embracing Data Parallelism
Advancing with Model Parallelism
Harnessing Hybrid Parallelism
Leveraging Elastic Scaling
Implementing Effective Load Balancing
Ensuring Fault Tolerance and Recovery
Using Prompt Caching.
Optimizing Performance with Step Functions and Lambda.
Notes:
Description based on publisher supplied metadata and other sources.
ISBN:
1394382634
9781394382637
1394406703
9781394406708
OCLC:
1547123958

The Penn Libraries is committed to describing library materials using current, accurate, and responsible language. If you discover outdated or inaccurate language, please fill out this feedback form to report it and suggest alternative language.

Find

Home Release notes

My Account

Shelf Request an item Bookmarks Fines and fees Settings

Guides

Using the Find catalog Using Articles+ Using your account