By John Mertic, Director of ODPi at The Linux Foundation
ODPi recently hosted a webinar on getting results from BI and Data Science with Cupid Chan, managing partner at 4C Decision, Moon soo Lee, CTO and co-founder of ZEPL and creator of Apache Zeppelin, and Frank McQuillan, director of product management at Pivotal.
During the webinar, we discussed the convergence of traditional BI and Data Science disciplines (machine learning, artificial intelligence… etc), and why statistical/data science models can now run on Hadoop in a much more cost effective manner than a few years ago.
The second part of the webinar focused on demos of Jupyter Notebooks and Apache Zeppelin. These were important and relevant demos, as Data Scientist utilize Jupyter Notebooks the most and Apache Zeppelin supports multiple technologies, multi-languages & environments; making it a great tool for BI.
The inspiration for the webinar was the new Data Science Notebook Guidelines. Created by the ODPi BI and Data Science SIG, the guidelines help bridge the gap so that BI tools can sit harmoniously on top of both Hadoop and RDBMS, while providing the same, or even more, business insight to the BI users who have also Hadoop in the backend. Download Now »
Additionally, webinar listeners asked detailed questions; including:
- How can one transition from a bioinformatics developer to Data scientist in Bio-statistic?
- Where do you see the future of both Jupyter and Zeppelin going? Are there other key data science challenges needing solved by these tools?
- When do you choose to use one notebook over the other?
- Can the 2 notebooks be used together? i.e., can you create a Jupyter notebook and save it, then upload it into Zeppelin (or vice versa)?
Overall, the webinar was an insightful discussion on how we can achieve big data ecosystem integration in a collaborative way