#2 in Data warehousing books
Use arrows to jump to the previous/next product
Reddit mentions of The Data Warehouse ETL Toolkit: Practical Techniques for Extracting, Cleaning, Conforming, and Delivering Data
Sentiment score: 4
Reddit mentions: 7
We found 7 Reddit mentions of The Data Warehouse ETL Toolkit: Practical Techniques for Extracting, Cleaning, Conforming, and Delivering Data. Here are the top ones.
Buying options
View on Amazon.comor
- Wiley
Features:
Specs:
Height | 9.200769 Inches |
Length | 7.299198 Inches |
Number of items | 1 |
Weight | 2.00179733896 Pounds |
Width | 1.200785 Inches |
It's hard to provide a full answer just based on available information, but roughly you have many different ways to achieve what you have in mind.
Some families of ways to handle this:
What's the most adequate way to do this depends on various factors:
Finally, here are some tools which can help:
If you are into this for the long term, it can be worth reading a book that I often mention, which is the ETL book by Ralph Kimball. While quite old, it provides interesting patterns.
Happy to detail this more if you provide more input!
The Data Warehouse ETL Toolkit
Dimensional modeling is under valued in today's climate. Any complex models on a large scale will be more effective when modeled optimally.
https://www.amazon.com/Data-WarehouseÂ-ETL-Toolkit-Techniques-Extracting/dp/0764567578
Informatica is widely used for ETL tools, but more important is understanding the challenges. How to design staging tables, what update strategy to use, how to design restartable jobs, and so on. This book refers to data warehousing, but the techniques are applicable to most scenarios.
Also The Data Warehouse ETL Toolkit: Practical Techniques for Extracting, Cleaning, Conforming, and Delivering Data https://www.amazon.com/dp/0764567578/ref=cm_sw_r_cp_apa_lnUSAbDC5NK4X
Beautiful. Thank you. Everything you just described here, falls in the realm of ETL? I just ordered a book on the subject. The logic does make sense explained this way.
http://www.amazon.com/The-Data-Warehouse-ETL-Toolkit/dp/0764567578/ref=sr_1_1?ie=UTF8&qid=1331798466&sr=8-1
Awesome! I kind of fell into the job. I was initially hired as a web developer, and didn't even know what BI was, and then got recruited by one of the BI managers and fell in love. To me, it is one of the few places in IT where what you create will directly impact the choices a business will make.
Most of what I do is ETL work (taking data from multiple systems, and loading them into a single warehouse), with a few cubes (multidimensional data analaysis) and SSRS report models (logical data model built on top of a relational data store used for ad hoc report creation). I also do a bit of report design, and lots of InfoPath 2010 + SharePoint 2010 custom development.
We use the entire Microsoft BI stack here, so SQL Server Integration (SSIS), Analysis (SSAS), and Reporting Services (SSRS). Microsoft is definitely up and coming in the BI world, but you might want to try to familiarize yourself with Oracle BI, Business Objects, or Cognos. Unfortunately, most of these tools are very expensive and not easy to get up and running. I would suggest you familiarize yourself with the concepts, and then you will be able to use any tool to apply them.
For data warehousing, check out the Kimball books:
Here and here and here
For reporting, get good with data visualizations, anything by Few or Tufte, like:
Here and here
For integration, check these out:
Here and here
Also, if you're interested in Microsoft BI (SSIS, SSAS, SSRS) check out this site. It has some awesome videos around SSAS that are easy to follow along with.
Also, check out the MSDN BI Blog: http://blogs.msdn.com/b/bi/
Currently at work, but if you have more questions, feel free to shoot me a message!