Professional Documents
Culture Documents
To complete the IBM Cloud Private Experiences, you must have one of the following web browsers:
• Firefox Version 54 or later
• Chrome Version 59.03 or later
To get started:
• Go to https://www.ibm.com/cloud/garage/cloud-private-experiences/icpd and sign up or log in.
• If you haven’t already, click Get my magic environment.
• Click Let’s Go on the path that you want to take.
• Locate your welcome email, which contains your user name and password.
• Log in to the web client with the credentials from the email.
1
The story
You work for a large multi-national bank and are responsible for processing mortgage applications.
Approving a mortgage application is manual and resource-intensive process. The process involves
looking at a number of variables such as a client’s income, credit card debt, the sale price of the house,
the loan amount, and so on, to determine whether the loan is a safe bet or a big gamble.
You are responsible for automating the process by building a machine learning model that can
accurately predict the likelihood of any given client defaulting on their mortgage.
The model will be invoked in real time by a web or mobile app available that customers use to apply
for a mortgage. The model enables your company to analyze a mortgage application and decide
whether to approve it in a few seconds with no manual intervention. This leads to significant cost
savings and, more importantly, improved customer service because applications will be approved or
rejected instantaneously.
The journeys
Collect
Connect to your existing databases so that you have quick and easy access to your data – no matter
where it lives.
Organize
Turn your data into trusted data. Use automated discovery to import, analyze, and classify data. Define
business terms so that your data is consistent. Apply rules and policies to make your data compliant
with regulations. Transform the data to make it useful. But most importantly, make the data easy for
users to find by publishing it to the enterprise data catalog.
Analyze
Build, deploy, and publish a machine learning model to predict whether clients will repay their
mortgage or default on it.
Future journeys with show you how to build, deploy and manage a modern web or mobile application
that can ingest online mortgage applications and generate decisions in real time by invoking the
machine learning model.
2
Collect
Learn to:
Collect your data in a single location
5 – 7 minutes
What will I learn?
In this stage of the journey, you’ll use integrated connectors to connect to
one of the bank's data sources. After you are connected, you can easily
discover data and select the data that you need to analyze.
Typically, you would use data that is already in your enterprise catalog so that you can be sure it
complies with all of the bank's regulations around privacy and security.
However, in this journey, we'll show how easy it is to add data from a remote data source to an
analytics project.
3
3. Select Create a new analytics project.
5. Click Create.
4
7. Click add data source.
8. Enter the following information into the appropriate fields to connect to the bank's IBM Db2
Warehouse database:
• Data source name:
data-analysis-connection__<myUserName>.
• Data source type:
Db2
• JDBC URL:
jdbc:db2://icpd-demo-ibm-db2oltp-
dev.default.svc.cluster.local:50000/MORTGAGE
• Username:
db2inst1
• Password:
password
5
9. Click Add remote data set so that you can specify the data that you want to analyze.
10. Enter the following information into the appropriate fields to add data to your project:
• Remote data set name:
MORTGAGE_JOIN
• Schema:
db2inst1
• Table:
MORTGAGE_JOIN
6
2. Find data in the catalog
The enterprise data catalog enables users to find relevant and trusted data, no matter where it lives.
Your data governance team should connect to all of your data sources and publish the assets to the
catalog so that the data can be used for analysis.
7
3. Ensure that the following tables are listed:
• MORTGAGE_CUSTOMER
• MORTGAGE_DEFAULT
• MORTGAGE_PROPERTY
8
Organize
Learn to:
Organize your data so it’s easy to find
5 – 7 minutes
What will I learn?
In this stage of the journey, you’ll clean, merge, and transform the data to
prepare it for machine learning. You’ll also set up business policies and rules
on the data so that it complies with any relevant regulations. After this
process is complete, you’ll have data you know you can trust.
1. Search for data in the catalog
Ideally, the data that you want is already in the enterprise data catalog.
To get started:
9
3. The results should include the following tables:
• MORTGAGE_CUSTOMER
• MORTGAGE_PROPERTY
• MORTGAGE_DEFAULT
10
2. Set policies and rules on your data
The best data is data that you can trust. You can be confident in your data when you know where it
comes from and that it conforms to a set of policies and rules that address issues like information
security, information privacy, or regulatory compliance.
Because you're dealing with customer information, you need to ensure that the data remains private
and that only specific users have access to it.
You can see which policies and rules are applied to an asset from the relationship graph or the Details
page.
To get started:
2. Click Details.
11
3. From the Details page, you can see that we’ve already applied several rules for you to ensure
that:
• The data is deleted when it’s no longer needed
• The data is not shared without appropriate approvals
12
3. Prepare the data for analysis
Before you can use data to build a machine learning model, you need to transform and cleanse it to
ensure you get the most accurate results.
In this example, we are transforming the data by joining three tables into a single table.
To get started:
You can optionally open the transformation workflow to see that it joins the three tables from
the catalog into a single table called MORTGAGE_JOIN.
13
3. Let's take a look at what the transformed data looks like. Click the Search icon to open the
search bar.
5. Click Preview.
14
7. Click Continue.
15
Analyze
Learn to:
Analyze data and build models
5 – 7 minutes
What will I learn?
In this stage of the journey, you’ll use the data that has been discovered and transformed to build a
prediction model. You’ll use notebook to quickly build and train the model.
What data do you want to use? [Mortgage]
Ensure that you downloaded the prerequisite files in the Analyze and Build module on
https://www.ibm.com/cloud/garage/cloud-private-experiences/icpd/
16
1. Create an analytics project
To get started:
17
4. Click OK.
5. Click Create. (It might take a few minutes to create your project.)
18
2. Add data to your project
Remember the prerequisite files you downloaded? You’re going to add the CSV files to your project so
that you can use the data in your analysis.
To get started:
1. Click Assets.
19
4. Click Select from your local file system.
20
3. Add a notebook to your project
Now that you have the data you need, you can start building a machine learning model.
To get started:
21
3. Click browse and upload the notebook (MortgageDefault-
BuildSave.jupyter.ipynb) from your workstation.
4. Click Create. (It might take a few seconds to create and launch your notebook.)
22
4. Build and train your model
This notebook creates a machine learning model to predict whether customers will default on a
mortgage. Feel free to explore the notebook before you proceed to get a better understanding of the
code.
To get started:
2. To confirm that the model was built and saved, click your project name.
23
5. Test your model
To get started:
You can adjust the values to see how different input affects the results.
24
You tested the model
25