Discovering your data: analyzing by connection The objective of this exercise is to set the connection properties and obtain an overview of the database and the data to be analyzed

This tutorial explains how to analyze your connection and collecting statistics and information about it.

Prerequisite:
The MySQL databases ("cif" and "crm") used in the examples must be configured correctly. The source files for these databases are available for download.

Download it!

You want to practice?

Download exampleFile.zip to get the files used for this tutorial.

You can also download tutorialProject.zip containing all the jobs needed to carry out this tutorial.

You can also:
Send it!

Share it!
Next Step: Discovering your data: analyzing by catalog

 


Creating a connection


In the DQ Repoditory view:

Expand the Metadata node and right-click DB Connection.

Select New connection in the menu.

Next
In the Database Connection wizard.

Enter your connection information.

Click Next.

Next


Name is an obligatory field. Spaces and special characters are not allowed.

Enter your database information (Login, password, hostname, port, etc.). Don't forget to verify that the information is correct by clicking Check.

Click Finish.

Next
The cif and cim databases can now be accessed via the repository.

Next
Creating a connection analysis


In the DQ Repository view:

Expand the Data Profiling node and right-click Analysis.

Select New Analysis in the menu.

Next
In the new window, select the Database Structure Overview category.

Click Next.

Next


The panel on the right describes the different analysis types and helps you to choose the one which best suits your needs.

Indicate your connection information. The Name field is obligatory and neither spaces nor special characters are allowed.

Click Next.

Next
Select the database that you want to analyze. In this example, the connection is called StagingDB.

Click Next.

Next
The new menu allows you to filter the tables and views that you want to analyze. You can insert a customized list. Each table name must be separated by a comma (eg.: table1,table2,table3).

Next
You can now run your analysis by clicking on the run icon.

Next
Viewing your analysis results


You can now access the results of your analysis.

Analysis Parameters: in this zone you can add filters on the tables or views.

Analysis summary: information concerning your last analysis.

Statistical Information: your analysis results. The rows higlighted in red indicate that no record is available.

Next
You can view all of your table keys by doing the following:

- In the table list, click right on the #keys column
- Select View keys

Next
By clicking the table names in the database view you can now access all of the information concerning the tables in your databases.

  Next Step: Discovering your data: analyzing by catalog

 

    Download it!     Send it!     Share it!

You want to practice?

Download exampleFile.zip to get the files used for this tutorial.

You can also download tutorialProject.zip containing all the jobs needed to carry out this tutorial.

Friends / colleagues may be interested in this tutorial? Send it to them!

You liked this tutorial ? Support it!

[ top ]