A CDA teaser, made by our team. Hope you enjoy it!
Monday, May 31, 2010
Wednesday, May 26, 2010
Pentaho reports by email - with steroids

Most people don't know this but there's an hidden gem inside Pentaho 3.5 (and the soon to be released 3.6);
The ability to send a report by email ...
Nothing new so far?
... with all images and resources inline
So no more of being forced to send pdf attachments in boring text or html emails, or the need to have your pentaho server accessible in the www. Pretty cool, hein?
Did you just said you like the pdf attachments?
You can still send your pdf attachments with the html email!
And I'm not done yet! Want to impress your friends of the other company by receiving yesterday's sales report on your smartphone while having breakfast?
Just render a text version of that report
and it'll automagically generate a mime/alternate email message and your email client will chose the best format to display the information to you
You didn't know this was possible? It's all in there. More than one and a half year's ago we developed a bunch of patches to allow this. And why? Because one of our customers wanted this feature. <shamelessplug> And that's what we do, develop cool stuff for our customers</shamelessplug>
We added:
* A new repository implementation was built to allow the generation of a mime-message directly from the jfreereport engine (with Thomas Morgner's help, of course)
* A patch to the reporting plugin to add a new supported output-type 'email'
* Support to the text output in the reporting plugin
* A SimpleEmailComponent POJO that sends the emails
The existing action to send emails is very limiting, so we had to develop a new one. Pentaho guys allowed this to be commited along with the reporting plugin. We all agree that the BI suite need a full-featured email component so this POJO will eventually be moved out of the reporting plugin. But until that happens - hey, let's use it!
nuff said; Want to see this working? We stored on webdetails site an archive of all the necessary files. Unzip it to your pentaho solution folder of choice, see how it works and feel free to play with it!
Have fun!
Monday, May 10, 2010
Pentaho 3.2 Data Integration: Beginner's Guide (not after reading the book)

This book was a total surprise. Though I've known Maria Roldan's work for a while as a very valuable community member - she wrote an excellent kettle tutorial in the wiki and by her participations in Portuguese and Spanish pentaho communities, but didn't know she was working on the book. Which is a very good sign for this very large and active community.
The contents of this book are also surprising. I wasn't expecting to find a chapter called "Working with databases" in #8 of the list. When working with kettle and the rest of the pentaho suite on a daily basis, building and populating large datawarehouses, you almost expect that to be first on the list. (By the way, the datawarehouse subject comes in chapter 12).
Odd? Well, not if you go through the book and think about it for a bit. Pentaho Data Integration (PDI) is not a tool to build datawarehouses and that can be used to other data integration tasks. PDI is an amazing tool to do any kind of data handling, involving datawarehouses or not, involving databases or not. Since it's great for any data handling, if course it's also great for the subset of data warehouse management.
And I think this is the main message Maria Roldan wants to pass. Forget perl hacks, forget homegrown shell scripts, please, don't write yet another stored procedure that will make you (or the poor guy after you) go nuts when it comes the time to support it. Just use kettle. You'll find a lot of screenshots with accurate explanations making this book very lightweight to read. You can even take it while on travel/vacations and you'll be able to understand it even without further experiments.
This book can bring anyone up to date with Kettle very fast. The first chapter is all about installation procedures. Chapters 2 and 10 describe all the concepts everyone needs to know about transformations and jobs. All the others are clearly the result of someone that spent a lot of time working with the tool. There are brief descriptions of a large number of steps used in real world examples and chapters that dedicate entirely to very important subjects: Data validation and error handling (chapter 7), transforming the rowset (chapter 6) and how to connect it all with the appropriate task flows (chapters 10 and 11)
But my favorite chapter is number 5 - The javascript chapter. When most of the experts in kettle will say avoid the javascript step at all costs, Maria dedicated an entire chapter to it. I love javascript and I love the javascript step (that's right Thomas, regardless of your complains). When used right, it can save your day - true, it can also ruin your transformation if you're not cautious. It will definitely slow your transformation down, and if you're an ETL God you'll look for other alternatives. But for almost other situations in the mortals world where we have other bottlenecks to worry about just use this step!
Summing it all up, if you need to take bits and bytes from point A to point B, just buy the book.
ps: I support the home team, and as such I'm biased towards Pentaho just like I'm biased towards Mozilla or my team at WebDetails but I'm not doing that in regards to this review. The proof that I enjoyed this book and all the hard work behind it is that as soon as I saw this twitter message I acted as fast as I could and now Maria Roldan is officially a member of the WebDetails team. Welcome Cari!
Subscribe to:
Posts (Atom)


