Pentaho Kettle solutions : building open source ETL solutions with Pentaho data integration / Matt Casters, Roland Bouman, Jos van Dongen.

Por: Casters, MattColaborador(es): Bouman, Roland [coautor] | Dongen, Jos van [coautor]Tipo de material: TextoTextoDetalles de publicación: Indianapolis, IN : Wiley Publishing, 2010 Descripción: XLV, 674 páginas : ilustraciones ; 24 cmISBN: 9780470635179Tema(s): Pentaho (Programa de ordenador)Resumen: This book describes Kettle and how it can be implemented, applied and managed, including an extensive collection of use cases and best practices. A major part of the book will be based on Kimball's 34 ETL subsystems. (Note that the book does not assume prior Kettle or ETL knowlegde which makes it an ideal start for anyone wanting to learn an ETL tool.) The book will cover all distinct components that make up the Kettle product and shows how they can be applied toreal-world scenarios. The book uses a solutions-oriented approach, meaning that the available toolset is not discussed from the tool perspective but from the solution perspective (i.e. what someone can accomplish using the product). The first half of the book (parts 1, 2 and 3) is devoted to the basic Kettle functionality and how it can be applied to get ETL solutions up and running. Parts 2 and 3 follow the '34 ETL subsystems' as described by Ralph Kimball. The 34 subsystems cover the entire ETL lifecycle and make for an excellent guideline to cover all parts of data warehousing with Kettle. The second half of the book (parts 4, 5 and 6) cover more advanced or specialized topics like clustering, extensibility and loading a data vault model. For every subject a real life example will be used that people can easily relate to, but due to the diverse nature of the different chapters there won't be an overall case to illustrate the concepts by. The variety of examples will also ensure a more lively discussion of the different topics. The book and the samples in it cover everything from simple single table data migration to complex multi system clustered data integration tasks. When people have read this book they will have learned the following: What ETL and data integration is, and why they need it The components that form the Kettle ETL tool set (and hows these components fulfill particular data integration needs) How to install and configure Kettle, and how to connect it to various data sources and targets. How to design and build every aspect of an ETL solution using Kettle How to build and load a data warehouse with Kettle How to deploy and schedule ETL solutions How to integrate and extend Kettle How to run and scale Kettle solutions using a distributed 'cloud'environment
Etiquetas de esta biblioteca: No hay etiquetas de esta biblioteca para este título. Inicie sesión para agregar etiquetas.
Valoración
    Valoración media: 0.0 (0 votos)
Existencias
Tipo de ítem Biblioteca de origen Signatura Estado Fecha de vencimiento Código de barras Reserva de ítems Bibliografía recomendada
Manuales 03. BIBLIOTECA INGENIERÍA PUERTO REAL
681.3.06PEN/CAS/pen (Navegar estantería(Abre debajo)) Disponible   Ubicación en estantería | Bibliomaps® 3745312270

TECNOLOGÍAS DE INTELIGENCIA DE NEGOCIO GRADO EN INGENIERÍA INFORMÁTICA Asignatura actualizada 2023-2024

Total de reservas: 0

This book describes Kettle and how it can be implemented, applied and managed, including an extensive collection of use cases and best practices. A major part of the book will be based on Kimball's 34 ETL subsystems. (Note that the book does not assume prior Kettle or ETL knowlegde which makes it an ideal start for anyone wanting to learn an ETL tool.) The book will cover all distinct components that make up the Kettle product and shows how they can be applied toreal-world scenarios. The book uses a solutions-oriented approach, meaning that the available toolset is not discussed from the tool perspective but from the solution perspective (i.e. what someone can accomplish using the product). The first half of the book (parts 1, 2 and 3) is devoted to the basic Kettle functionality and how it can be applied to get ETL solutions up and running. Parts 2 and 3 follow the '34 ETL subsystems' as described by Ralph Kimball. The 34 subsystems cover the entire ETL lifecycle and make for an excellent guideline to cover all parts of data warehousing with Kettle. The second half of the book (parts 4, 5 and 6) cover more advanced or specialized topics like clustering, extensibility and loading a data vault model. For every subject a real life example will be used that people can easily relate to, but due to the diverse nature of the different chapters there won't be an overall case to illustrate the concepts by. The variety of examples will also ensure a more lively discussion of the different topics. The book and the samples in it cover everything from simple single table data migration to complex multi system clustered data integration tasks. When people have read this book they will have learned the following: What ETL and data integration is, and why they need it The components that form the Kettle ETL tool set (and hows these components fulfill particular data integration needs) How to install and configure Kettle, and how to connect it to various data sources and targets. How to design and build every aspect of an ETL solution using Kettle How to build and load a data warehouse with Kettle How to deploy and schedule ETL solutions How to integrate and extend Kettle How to run and scale Kettle solutions using a distributed 'cloud'environment

No hay comentarios en este titulo.

para aportar su opinión.

Con tecnología Koha