Last week I managed to get my hands on the training material for the 2 Day Oracle Data Miner course. This course is run by Oracle University.
Many thanks to Michael O’Callaghan who is a BI Sales person here in Ireland and Oracle University, for arranging this.
The 2 days are pretty packed with a mixture of lecture type material, lots of hands on exercises and some time for open discussions. In particular, day 2 will be very busy day.
Check out the course outline and published schedule – click here
You can have this course on site at your organisation. If this is something that interests you then contact your Oracle University account manager. There is also the traditional face-to-face delivery and the newer online delivery, where people from around the world come together for the online class.
There are a number of Oracle Advanced Analytics and related topics taking place this week at COLLABORATE12 in Las Vegas (http://collaborate12.com).
|Sun 22nd||9:00-3pm||Oracle Business Intelligence Application Journey|
|Mon 23rd||9:45-10:45||Managing Unstructured Data using Hadoop, Oracle 11g and Oracle Exadata Database Machine||Jim Steiner|
|Mon 23rd||9:45-10:45||Environmental Data Management and Analytics-a Real World Perspective||Angela Miller|
|Mon 23rd||11-12||Public Safety and Environmental Real-Time Analytics using Oracle Business Intelligence||Raghav Venkat
|Mon 23rd||11-12||BI is more than slice and dice||Peter Scott|
|Mon 23rd||14:30-15:30||In-Database Analytics: Predictive Analytics, Data Mining, Exadata & Business Intelligence||Jacek Myczkowski|
|Mon 23rd||15:45-16:45||Big Data Analytics, R you ready||Mark Hornick
|Tues 24th||10:45-11:45||BI Analytics and Oracle NoSQL. The Future of Now||Manish Khera|
|Wed. 25th||8:15-9:15||Oracle Data Mining – A Component of the Oracle Advanced Analytics Option-Hands-on Lab||Charlie Berger|
|Wed 25th||9:30-10:30||Oracle R Enterprise – A Component of the Oracle Advanced Analytics Option-Hands-on Lab||Mark Hornick|
Here are the abstracts from the two main Oracle Advanced Analytics presentations by Charlie Berger and Mark Hornick
Oracle Data Mining – A Component of the Oracle Advanced Analytics Option
This Hands-on Lab provides an introduction to Oracle Data Mining and the Oracle Data Miner GUI.
Oracle Data Mining (ODM), now part of Oracle Advanced Analytics, provides an extensive set of in-database data mining algorithms that solve a wide range of business problems. It can predict customer behavior, detect fraud, analyze market baskets, segment customers, and mine text to extract sentiments. ODM provides powerful data mining algorithms that run as native SQL functions for in-database model building and model deployment. There is no need for the time delays and security risks of data movement.
The free Oracle Data Miner GUI is an extension to Oracle SQL Developer 3.1 that enables data analysts to work directly with data inside the database, explore the data graphically, build and evaluate multiple data mining models, apply ODM models to new data, and deploy ODM’s predictions and insights throughout the enterprise. Oracle Data Miner work flows capture and document the user’s analytical methodology and can be saved and shared with others to automate advanced analytical methodologies.
Oracle R – A component of the Oracle Advanced Analytics Option
This Hands-on Lab provides an introduction to Oracle R Enterprise.
Oracle R Enterprise, a part of the Oracle Advanced Analytics Option, makes the open source R statistical programming language and environment ready for the enterprise by integrating R with Oracle Database. R users can interactively and transparently execute R scripts for statistical and graphical analyses on data stored in Oracle Database. R scripts can be executed in Oracle Database using potentially multiple database-managed R engines – resulting in data parallel execution. ORE also provides a rich set of statistical functions and advanced analytics techniques.
In this lab, attendees will be introduced to Oracle’s strategy for R, including the Oracle R Distribution, Oracle R Enterprise (ORE), and Oracle R Connector for Hadoop (ORCH). We will focus on Oracle R Enterprise with hands-on exercises exploring the transparency layer, embedded R execution, and statistics engine.
Here is a selection of videos and websites on Data Visualisations.
Hans Rosling videos of his TED talks
- World Population Growth
- Global Population Growth (TED)
- Asia’s Rise – How and When
- HIV: New facts and stunning data visuals
- Video for the BBC
Charlie Berger (Sr. Director Product Management, Data Mining & Advanced Analytics) as produced a video based on a recent presentation called ‘Oracle Advanced Analytics: Oracle R Enterprise & Oracle Data Mining’.
This is a 1 hour video, including some demos, of product background, product features, recent developments and new additions, examples of how Oracle is including Oracle Data Mining into their fusion applications, etc.
Oracle has 2 data mining products, with main in-database Oracle Data Mining and the more recent extensions to R to give us Oracle R Enterprise.
Check out the video – Click here.
Check out Charlie’s blog at https://blogs.oracle.com/datamining/
Oracle University : 2 Day Oracle Data Mining training course
In a previous blog post I explained what attribute importance is and how it can be used in the Oracle Data Miner tool (click here to see blog post).
In this post I want to show you how to perform the same task using the ODM PL/SQL API.
The ODM tool makes extensive use of the Automatic Data Preparation (ADP) function. ADP performs some data transformations such as binning, normalization and outlier treatment of the data based on the requirements of each of the data mining algorithms. In addition to these transformations we can specify our own transformations. We do this by creating a setting tables which will contain the settings and transformations we can the data mining algorithm to perform on the data.
ADP is automatically turned on when using the ODM tool in SQL Developer. This is not the case when using the ODM PL/SQL API. So before we can run the Attribute Importance function we need to turn on ADP.
Step 1 – Create the setting table
CREATE TABLE Att_Import_Mode_Settings (
Step 2 – Turn on Automatic Data Preparation
INSERT INTO Att_Import_Mode_Settings (setting_name, setting_value)
Step 3 – Run Attribute Importance
model_name => ‘Attribute_Importance_Test’,
mining_function => DBMS_DATA_MINING.ATTRIBUTE_IMPORTANCE,
data_table_name > ‘mining_data_build_v’,
case_id_column_name => ‘cust_id’,
target_column_name => ‘affinity_card’,
settings_table_name => ‘Att_Import_Mode_Settings’);
Step 4 – Select Attribute Importance results
ORDER BY RANK;
ATTRIBUTE_NAME IMPORTANCE_VALUE RANK
——————– —————- ———-
HOUSEHOLD_SIZE .158945397 1
CUST_MARITAL_STATUS .158165841 2
YRS_RESIDENCE .094052102 3
EDUCATION .086260794 4
AGE .084903512 5
OCCUPATION .075209339 6
Y_BOX_GAMES .063039952 7
HOME_THEATER_PACKAGE .056458722 8
CUST_GENDER .035264741 9
BOOKKEEPING_APPLICAT .019204751 10
CUST_INCOME_LEVEL 0 11
BULK_PACK_DISKETTES 0 11
OS_DOC_SET_KANJI 0 11
PRINTER_SUPPLIES 0 11
COUNTRY_NAME 0 11
FLAT_PANEL_MONITOR 0 11
In my previous blog post (Exalytics : How much will it cost me ?) I gave an outline of the pricing you might expect for an Exalytics machine.
The final pricing that I gave of approx $3+M was based on the per processor licencing.
Yesterday (24th Jan) the Oracle Business Intelligence blog by Manan, included the pricing based on the per user licences.
The following is a breakdown of the Exalytics pricing based on the minimum 100 user licencing.
Licence Costs (100 users)
Exalytics machine = $135,000
TimesTen = $300 x 100 users = $30,000
BI Foundation Suite = $3,675 x 100 users = $367,500
Giving a grand total of $532,500.
Support Costs (100 users)
But we need to add the annual support costs to this.
Exalytics machine support = $29,700.
TimesTen support = $66 x 100 users = $6,600
BI Foundations suite = $809 x 100 users = $80,900
Total support costs (100 users) = $116,500
First year & on-going costs costs
Total first year cost for an Exalytics machine = $532,500 + $117,200 = $649,700
Plus on going annual support costs of $117,200 in year 2 and subsequent years.
If you are one of the lucky customer who can If I use the same discounts, as I did in my previous blog post, of 25% discount on hardware and 60% discount on the software, we get:
Year 1 cost of : ($135,000*0.75) + ($397,500*0.40) = $260,250
So it might be possible to get an Exalytics machine for $260+K, plus annual support costs.
Over the past couple of weeks the costing for the Oracle Exalytics machine has been made public by Oracle and there has been a number of articles. What I’ve done in this blog post is to collate this information. I give what I understand to be the cost of purchasing an Exalytic machine and to get setup and running.
The pricing structure starts at
Exalytics machine + cost of BI Foundation Suite + TimesTen licences
Exalytics machine = $135,000
TimesTen = $34,500 per processor licence or $300 per named user(min 100 users)
BI Foundation Suite = $450,000 per processor licence or $3,675 per named user (same number of users as for TimesTen = min 100 users)
Annual Support Costs
Exalytics machine = $29,700
TimesTen = 22% of software licence – $7,590 per processor licence or $66 per named user (min 100 users)
BI Foundation Suite = $99,000 per processor licence or $809 per named user(min 100 users)
The Exalytics machine consists of a single server with 1TB of RAM and 4 Intel Xeon E7-4800 processors, with 10 cores each.
So the total cost of an Exalytics machine based on the processor licence will be something towards the $10M. Now this is before the discounts that you can negotiate. There are reports of discounts ranging up to 25% on hardware and 60% on software. The size of the discount is depended on your size etc. So this initial $10M cost could be reduced to $3M+.
Please note that I may have gotten some or all of this pricing wrong. If I have then forgive me and let me know what is wrong. I can correct it to ensure that we have the correct costs.