6 options
Pentaho 3.2 data integration : beginner's guide : explore, transform, validate, and integrate your data with ease / María Carina Roldán.
- Format:
- Book
- Author/Creator:
- Roldán, María Carina.
- Language:
- English
- Subjects (All):
- Data integration (Computer science).
- Physical Description:
- 1 online resource (493 p.)
- Edition:
- 1st edition
- Place of Publication:
- Birmingham, U.K. : Packt Pub., 2010.
- Language Note:
- English
- System Details:
- text file
- Summary:
- Explore, transform, validate, and integrate your data with ease
- Contents:
- Cover; Copyright; Credits; Foreword; The Kettle Project; About the Author; About the Reviewers; Table of Contents; Preface; Chapter 1: Getting started with Pentaho Data Integration; Pentaho Data Integration and Pentaho BI Suite; Exploring the Pentaho Demo; Pentaho Data Integration; Using PDI in real world scenarios; Loading data warehouses or data marts; Integrating data; Data cleansing; Migrating information; Exporting data; Integrating PDI using Pentaho BI; Installing PDI; Time for action - installing PDI; Launching the PDI graphical designer: Spoon
- Time for action - starting and customizing SpoonSpoon; Setting preferences in the Options window; Storing transformations and jobs in a repository; Creating your first transformation; Time for action - creating a hello world transformation; Directing the Kettle engine with transformations; Exploring the Spoon interface; Running and previewing the transformation; Time for action - running and previewing the hello_world; transformation; Installing MySQL; Time for action - installing MySQL on Windows; Time for action - installing MySQL on Ubuntu; Summary
- Chapter 2: Getting Started with TransformationsReading data from files; Time for action - reading results of football matches from files; Input files; Input steps; Reading several files at once; Time for action - reading all your files at a time using a single; Text file input step; Time for action - reading all your files at a time using a single; Text file input step and regular expressions; Regular expressions; Grids; Sending data to files; Time for action - sending the results of matches to a plain file; Output files; Output steps; Some data definitions; Rowset; Streams
- The Select values stepGetting system information; Time for action - updating a file with news about examinations; Getting information by using Get System Info step; Data types; Date fields; Numeric fields; Running transformations from a terminal window; Time for action - running the examination transformation from; a terminal window; XML files; Time for action - getting data from an XML file with information; about countries; What is XML; PDI transformation files; Getting data from XML files; XPath; Configuring the Get data from XML step; Kettle variables; How and when you can use variables
- SummaryChapter 3: Basic data manipulation; Basic calculations; Time for action - reviewing examinations by using the; Calculator step; Adding or modifying fields by using different PDI steps; The Calculator step; The Formula step; Time for action - reviewing examinations by using the; Formula step; Calculations on groups of rows; Time for action - calculating World Cup statistics by; grouping data; Group by step; Filtering; Time for action - counting frequent words by filtering; Filtering rows using the Filter rows step; Looking up data
- Time for action - finding out which language people speak
- Notes:
- Includes index.
- ISBN:
- 9786612557361
- 9781282557369
- 128255736X
- 9781847199553
- 1847199550
- OCLC:
- 796383966
The Penn Libraries is committed to describing library materials using current, accurate, and responsible language. If you discover outdated or inaccurate language, please fill out this feedback form to report it and suggest alternative language.