You are on page 1of 25

IBM Cloud Private Experiences

Before you begin

Check out the Welcome Video: http://ibm.biz/icpe-welcome

To complete the IBM Cloud Private Experiences, you must have one of the following web browsers:
• Firefox Version 54 or later
• Chrome Version 59.03 or later

To get started:
• Go to https://www.ibm.com/cloud/garage/cloud-private-experiences/icpd and sign up or log in.
• If you haven’t already, click Get my magic environment.
• Click Let’s Go on the path that you want to take.
• Locate your welcome email, which contains your user name and password.
• Log in to the web client with the credentials from the email.

1
The story
You work for a large multi-national bank and are responsible for processing mortgage applications.

Approving a mortgage application is manual and resource-intensive process. The process involves
looking at a number of variables such as a client’s income, credit card debt, the sale price of the house,
the loan amount, and so on, to determine whether the loan is a safe bet or a big gamble.

You are responsible for automating the process by building a machine learning model that can
accurately predict the likelihood of any given client defaulting on their mortgage.

The model will be invoked in real time by a web or mobile app available that customers use to apply
for a mortgage. The model enables your company to analyze a mortgage application and decide
whether to approve it in a few seconds with no manual intervention. This leads to significant cost
savings and, more importantly, improved customer service because applications will be approved or
rejected instantaneously.

The journeys

Collect
Connect to your existing databases so that you have quick and easy access to your data – no matter
where it lives.

Organize
Turn your data into trusted data. Use automated discovery to import, analyze, and classify data. Define
business terms so that your data is consistent. Apply rules and policies to make your data compliant
with regulations. Transform the data to make it useful. But most importantly, make the data easy for
users to find by publishing it to the enterprise data catalog.

Analyze
Build, deploy, and publish a machine learning model to predict whether clients will repay their
mortgage or default on it.

Future journeys with show you how to build, deploy and manage a modern web or mobile application
that can ingest online mortgage applications and generate decisions in real time by invoking the
machine learning model.

2
Collect

Learn to:
Collect your data in a single location
5 – 7 minutes
What will I learn?
In this stage of the journey, you’ll use integrated connectors to connect to
one of the bank's data sources. After you are connected, you can easily
discover data and select the data that you need to analyze.

1. Create a project for your analysis

Typically, you would use data that is already in your enterprise catalog so that you can be sure it
complies with all of the bank's regulations around privacy and security.

However, in this journey, we'll show how easy it is to add data from a remote data source to an
analytics project.

To get started, establish a connection to one of the bank's data sources:

1. From the menu, click Collect > My Data

2. Click add data set.

3
3. Select Create a new analytics project.

4. Name the project data-analysis_<myUserName>.

5. Click Create.

6. Click Remote Data Set.

4
7. Click add data source.

8. Enter the following information into the appropriate fields to connect to the bank's IBM Db2
Warehouse database:
• Data source name:
data-analysis-connection__<myUserName>.
• Data source type:
Db2
• JDBC URL:
jdbc:db2://icpd-demo-ibm-db2oltp-
dev.default.svc.cluster.local:50000/MORTGAGE
• Username:
db2inst1
• Password:
password

5
9. Click Add remote data set so that you can specify the data that you want to analyze.

10. Enter the following information into the appropriate fields to add data to your project:
• Remote data set name:
MORTGAGE_JOIN
• Schema:
db2inst1
• Table:
MORTGAGE_JOIN

11. Click Create.

You created a project and added data to it

6
2. Find data in the catalog

The enterprise data catalog enables users to find relevant and trusted data, no matter where it lives.
Your data governance team should connect to all of your data sources and publish the assets to the
catalog so that the data can be used for analysis.

To verify that the customer data is in the catalog:

1. From the menu, click Organize > Data catalog.

2. Click Database Tables.

7
3. Ensure that the following tables are listed:
• MORTGAGE_CUSTOMER
• MORTGAGE_DEFAULT
• MORTGAGE_PROPERTY

You found data in the catalog

You have all of your data at your fingertips


By completing this journey, you:
• Connected to your data source
• Created a single point of entry for your data
• Identified relevant data in the enterprise data catalog

8
Organize
Learn to:
Organize your data so it’s easy to find
5 – 7 minutes
What will I learn?
In this stage of the journey, you’ll clean, merge, and transform the data to
prepare it for machine learning. You’ll also set up business policies and rules
on the data so that it complies with any relevant regulations. After this
process is complete, you’ll have data you know you can trust.




1. Search for data in the catalog

Ideally, the data that you want is already in the enterprise data catalog.

To get started:

1. Click the Search icon to open the search bar.

2. Type Mortgage in the search bar and hit Enter.

9
3. The results should include the following tables:
• MORTGAGE_CUSTOMER
• MORTGAGE_PROPERTY
• MORTGAGE_DEFAULT

You found data in the catalog

10
2. Set policies and rules on your data

The best data is data that you can trust. You can be confident in your data when you know where it
comes from and that it conforms to a set of policies and rules that address issues like information
security, information privacy, or regulatory compliance.

Because you're dealing with customer information, you need to ensure that the data remains private
and that only specific users have access to it.

You can see which policies and rules are applied to an asset from the relationship graph or the Details
page.

To get started:

1. From the search results, select MORTGAGE_CUSTOMER.

2. Click Details.

11
3. From the Details page, you can see that we’ve already applied several rules for you to ensure
that:
• The data is deleted when it’s no longer needed
• The data is not shared without appropriate approvals

You set policies and rules on your data

12
3. Prepare the data for analysis

Before you can use data to build a machine learning model, you need to transform and cleanse it to
ensure you get the most accurate results.

Cleansing removes duplicate, incorrect, incomplete, or improperly formatted data.

In this example, we are transforming the data by joining three tables into a single table.

To get started:

1. Click Organize > Transform data.

2. We created a data transformation workflow for you. The workflow is called


Mortgage_datatransform.

You can optionally open the transformation workflow to see that it joins the three tables from
the catalog into a single table called MORTGAGE_JOIN.

13
3. Let's take a look at what the transformed data looks like. Click the Search icon to open the
search bar.

4. Type MORTGAGE_JOIN in the search bar and hit Enter.

5. Click Preview.

6. Authenticate to the database with the following credentials:


• User name:
db2inst1
• Password:
password

14
7. Click Continue.

8. When you’re done exploring the data, click Close.

You prepared the data for analysis

You organized your data so it’s easy to find


By completing this journey, you:
• Ensured compliance with organizational and regulatory requirements
• Transformed your data to make it useful

15
Analyze
Learn to:
Analyze data and build models
5 – 7 minutes
What will I learn?
In this stage of the journey, you’ll use the data that has been discovered and transformed to build a
prediction model. You’ll use notebook to quickly build and train the model.
What data do you want to use? [Mortgage]

Before you begin

Ensure that you downloaded the prerequisite files in the Analyze and Build module on
https://www.ibm.com/cloud/garage/cloud-private-experiences/icpd/

Click Let’s go to start the journey.

16
1. Create an analytics project

A project is a collection of assets that you need to perform your analysis.

To get started:

1. From the menu, click Projects.

2. Click New Project.

3. Name your project data-analysis_<myUserName>2.

17
4. Click OK.

5. Click Create. (It might take a few minutes to create your project.)

You created an analytics project

18
2. Add data to your project

Remember the prerequisite files you downloaded? You’re going to add the CSV files to your project so
that you can use the data in your analysis.

To get started:

1. Click Assets.

2. Click Data Sets.

3. Click add data set.

19
4. Click Select from your local file system.

5. Upload the following files from your workstation:


• Mortgage_Customer.csv
• Mortgage_Default.csv
• Mortgage_Property.csv

You added data to your project

20
3. Add a notebook to your project

Now that you have the data you need, you can start building a machine learning model.

To get started:

1. Click the + icon and select Create notebook.

2. Select From File.

21
3. Click browse and upload the notebook (MortgageDefault-
BuildSave.jupyter.ipynb) from your workstation.

4. Click Create. (It might take a few seconds to create and launch your notebook.)

You added a notebook to your project

22
4. Build and train your model

This notebook creates a machine learning model to predict whether customers will default on a
mortgage. Feel free to explore the notebook before you proceed to get a better understanding of the
code.

To get started:

1. Click Cell > Run All to train the model.

2. To confirm that the model was built and saved, click your project name.

3. Confirm that the Predict Mortgage Default LOS model is displayed.

You trained and saved the model

23
5. Test your model

Your model is ready to test.

To get started:

1. Open the action menu and click Test.

2. Scroll to the Input section and click Submit.

You can adjust the values to see how different input affects the results.

24
You tested the model

You analyzed data and built a powerful model


By completing this journey, you:
• Created an analytics project.
• Ran a notebook to generate a model and saved it to the model repository.
• Tested the model.

25

You might also like