DataQAdm 8117
DataQAdm 8117
Copyright 2005, 2012 Oracle and/or its affiliates. All rights reserved. This software and related documentation are provided under a license agreement containing restrictions on use and disclosure and are protected by intellectual property laws. Except as expressly permitted in your license agreement or allowed by law, you may not use, copy, reproduce, translate, broadcast, modify, license, transmit, distribute, exhibit, perform, publish, or display any part, in any form, or by any means. Reverse engineering, disassembly, or decompilation of this software, unless required by law for interoperability, is prohibited. The information contained herein is subject to change without notice and is not warranted to be errorfree. If you find any errors, please report them to us in writing. If this is software or related documentation that is delivered to the U.S. Government or anyone licensing it on behalf of the U.S. Government, the following notice is applicable: U.S. GOVERNMENT END USERS: Oracle programs, including any operating system, integrated software, any programs installed on the hardware, and/or documentation, delivered to U.S. Government end users are commercial computer software pursuant to the applicable Federal Acquisition Regulation and agency-specific supplemental regulations. As such, use, duplication, disclosure, modification, and adaptation of the programs, including any operating system, integrated software, any programs installed on the hardware, and/or documentation, shall be subject to license terms and license restrictions applicable to the programs. No other rights are granted to the U.S. Government. This software or hardware is developed for general use in a variety of information management applications. It is not developed or intended for use in any inherently dangerous applications, including applications that may create a risk of personal injury. If you use this software or hardware in dangerous applications, then you shall be responsible to take all appropriate fail-safe, backup, redundancy, and other measures to ensure its safe use. Oracle Corporation and its affiliates disclaim any liability for any damages caused by use of this software or hardware in dangerous applications. Oracle and Java are registered trademarks of Oracle and/or its affiliates. Other names may be trademarks of their respective owners. Intel and Intel Xeon are trademarks or registered trademarks of Intel Corporation. All SPARC trademarks are used under license and are trademarks or registered trademarks of SPARC International, Inc. AMD, Opteron, the AMD logo, and the AMD Opteron logo are trademarks or registered trademarks of Advanced Micro Devices. UNIX is a registered trademark of The Open Group. This software or hardware and documentation may provide access to or information on content, products, and services from third parties. Oracle Corporation and its affiliates are not responsible for and expressly disclaim all warranties of any kind with respect to third-party content, products, and services. Oracle Corporation and its affiliates will not be responsible for any loss, costs, or damages incurred due to your access to or use of third-party content, products, or services.
Contents
Data Quality Guide for Oracle Customer Hub 1
Chapter 1: Chapter 2:
Data Profiling
21
22
Chapter 3:
Data Cleansing Data Matching
Match Key Generation with the Oracle Data Quality Matching Server 28 Match Key Generation Using the Universal Connector with Third-Party Data Quality Vendors 28
Identification of Candidate Records Calculation of Match Scores Displaying Duplicates Fuzzy Query 32 31 30
29
Chapter 4:
Levels of Enabling and Disabling Data Cleansing and Data Matching Enabling Data Quality at the Enterprise Level Specifying Data Quality Settings 37
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
Contents
39
41
Enabling Data Quality Using the GUI 40 Enabling Data Quality Using the Command-Line Interface
Enabling Data Quality at the User Level Enabling and Disabling Fuzzy Query 43
42 43 45
Disabling Data Cleansing for Specific Records Identifying Mandatory Fields for Fuzzy Query
Chapter 5:
Registering New Data Quality Connectors 51 Configuring Business Components and Applets for Data Matching and Data Cleansing
53 53
Mapping Data Matching Vendor Fields to Siebel Business Components 54 Adding a Field Mapping for Data Matching 54 Mapping Data Cleansing Vendor Fields to Siebel Business Component Fields
55
56
57
Configuring the Mandatory Fields for Fuzzy Query Data Quality User Properties 58
58
Account - Business Component User Properties 59 Contact - Business Component User Properties 60 List Mgmt Prospective Contact - Business Component User Properties DeDuplication - Business Service User Properties 63 Data Cleansing - Business Service User Properties 63 DQ Sync Services - Business Service User Properties 63 ISS System Services - Business Services User Properties 66
62
67 68
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
Contents
Chapter 6:
Process of Configuring Oracle Enterprise Data Quality for Matching and Address Verification 70
Configuring Configuring Configuring Configuring the Siebel Server Under Windows 70 the Siebel Server Under UNIX 71 the Siebel Server to Use Oracle Enterprise Data Quality a New Data Quality Vendor 73
72
Process of Configuring the Staging Database for Oracle Enterprise Data Quality
Creating Tables 75 Configuring Connections 75
75
76 77
Oracle Enterprise Data Quality Field Mappings for Siebel Business Components
85
Enabling Support of Multiple Child Entity Deduplication for Oracle Enterprise Data Quality 85 Data Format for Deduplication of Multiple Child Entities 85
Chapter 7:
Setting Up the Environment and the Database 90 Installing Oracle Data Quality Matching Server 92 Creating Database Users and Tables for Oracle Data Quality Matching Server 97 Configuring Oracle Data Quality Matching Server 100 Modifying Configuration Parameters for Oracle Data Quality Matching Server 104 Deploying Workflows for Oracle Data Quality Matching Server Integration 105
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN) 5
Contents
Initial Loading of Siebel Data into Oracle Data Quality Matching Server Tables
106
108 110
110
Universal Connector Parameter and Field Mapping Values for Oracle Data Quality Matching Server 111
Preconfigured Vendor Parameters for Oracle Data Quality Matching Server 112 Preconfigured Field Mappings for Oracle Data Quality Matching Server 112
Configuring a New Field for Real-Time Data Matching Incremental Data Load 118
115
Process of Configuring Data Synchronization Between Siebel and Oracle Data Quality Matching Server 120
Configuring the Data Source 120 Configuring Object Managers for ISS Data Source 121
122
DQ Sync Action Sets for Account 122 DQ Sync Action Sets for Contact 127 DQ Sync Action Sets for List Mgmt Prospective Contact Generic DQ Sync Action Sets 137
133
Configuring Deduplication Against Multiple Addresses Configuring Multiple Mode Support for Data Matching
ssadq_cfg.xml 146
Installing Oracle Data Quality Address Validation Server 150 Modifying Configuration Parameters for Oracle Data Quality Address Validation Server 151 Acquiring the License Key and Postal Directories for Oracle Data Quality Address Validation Server 153
Configuring Siebel Business Applications for the Oracle Data Quality Address Validation Server 155 Universal Connector Parameter and Field Mapping Values for Oracle Data Quality Address Validation Server 156
Preconfigured Vendor Parameters for Oracle Data Quality Address Validation Server 157 Preconfigured Field Mappings for Oracle Data Quality Address Validation Server 157
Sample Configuration File for Oracle Data Quality Address Validation Server
159
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
Contents
ssadq_cfgasm.xml
160
161
ISSSYNC Action Sets for Account 162 ISSSYNC Action Sets for Contact 169 ISSSYNC Action Sets for List Mgmt Prospective Contact Generic ISSSYNC Action Sets 183
175
Activating Action Sets in Your Siebel Business Application Sample SQL Scripts 184
IDS_IDT_ACCOUNT_STG.SQL 185 IDS_IDT_CONTACT_STG.SQL 186 IDS_IDT_PROSPECT_STG.SQL 187 IDS_IDT_CURRENT_BATCH.SQL 188 IDS_IDT_CURRENT_BATCH_ACCOUNT.SQL 189 IDS_IDT_CURRENT_BATCH_CONTACT.SQL 189 IDS_IDT_CURRENT_BATCH_PROSPECT.SQL 190 IDS_IDT_LOAD_ANY_ENTITY.CMD 190 IDS_IDT_LOAD_ANY_ENTITY.sh 192
184
197
Chapter 8:
Real-Time Data Cleansing and Data Matching 206 Batch Data Cleansing and Data Matching 207
Data Quality Batch Job Parameters Cleansing Data Using Batch Jobs Matching Data Using Batch Jobs
Example of Batch Data Matching Using the Universal Connector 213 Generating or Refreshing Keys Using Batch Jobs 215 Customizing Data Quality Server Component Jobs for Batch Mode 216
217
219
220
220
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
Contents
222
222 223
Using Fuzzy Query for Accounts Using Fuzzy Query for Contacts
224
225
Scenario for Data Matching Using the Value Match Method 224 Scenario for Data Cleansing Using Data Cleansing Business Service Methods Deduplication Business Service Methods 225 Data Cleansing Business Service Methods 230
Connector Initialization and Termination Functions Session Initialization and Termination Functions Parameter Setting Functions Error Message Function 238 239
241
235 237
242
242
250
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
Contents
259
Index
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
Contents
10
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
Whats New in Data Quality Guide for Oracle Customer Hub, Version 8.2 (8.1.1.7 Quick Fix 07AN)
Table 1 lists changes described in this version of the documentation to support release 8.2 of the software.
Table 1. Topic
Changes in Data Quality Guide for Oracle Customer Hub, Version 8.2 (8.1.1.7 Quick Fix 07AN) Description New topic. It details the process of configuring Oracle Enterprise Data Quality for matching and address validation in Siebel CRM. New topic. It details the process of configuring a staging database for Oracle Enterprise Data Quality. New topic. It includes all the parameters required to enable and manage third-party vendors with Oracle Enterprise Data Quality. New topic. It provides all field mappings for the Siebel business components working with Oracle Enterprise Data Quality. New topic. It provides an overview of configuring third-party key generation for Oracle Enterprise Data Quality. New topic. It provides an overview of the enabling support for multiple child entity deduplication for Oracle Enterprise Data Quality. New topic. It provides the data format for the deduplication of multiple child entities. Modified topic. Added information about the MultipleChildEntitiesDedup input argument for the Value Match method. New topic. It details the external key generation function which allows you to use third party vendors to generate deduplication keys in Siebel CRM. Modified topic. Added new information about the New Key Generation algorithm.
Process of Configuring Oracle Enterprise Data Quality for Matching and Address Verification on page 70 Process of Configuring the Staging Database for Oracle Enterprise Data Quality on page 75 Vendor Parameters for Oracle Enterprise Data Quality on page 76 Oracle Enterprise Data Quality Field Mappings for Siebel Business Components on page 77 Configuring Third-Party Key Generation for Oracle Enterprise Data Quality on page 85 Enabling Support of Multiple Child Entity Deduplication for Oracle Enterprise Data Quality on page 85 Data Format for Deduplication of Multiple Child Entities on page 85 Calling Data Matching and Data Cleansing from Scripts or Workflows on page 224 External Key Generation Function on page 237 Data Matching and Data Cleansing Algorithms on page 248
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
11
Whats New in Data Quality Guide for Oracle Customer Hub, Version 8.2 (Fix Pack 8.1.1.4)
Table 2 lists changes described in this version of the documentation to support release 8.2 of the software.
Table 2. Topic
Changes in Data Quality Guide for Oracle Customer Hub, Version 8.2 (Fix Pack 8.1.1.4) Description New topic. Describes how to install Oracle Data Quality Matching Server on UNIX. New topic. Describes how to configure Oracle Data Quality Matching Server on UNIX. New topic. Describes how to install Oracle Data Quality Address Validation Server for data cleansing. Oracle Data Quality Address Validation Server uses a licensed version of the third party software, Informatica Identity Resolution from Informatica, for data cleansing. Oracle Data Quality Address Validation Server supports Informatica Address Doctor Version 5 postal directories. Modified topic. Provides information about data quality user properties, including deduplication and data cleansing business service user properties. The following deduplication and data cleansing user properties are no longer required: Deduplication user properties
Installing Oracle Data Quality Matching Server on page 92 Configuring Oracle Data Quality Matching Server on UNIX on page 102 Process of Installing the Oracle Data Quality Address Validation Server on page 149
Configuring a New Field for RealTime Data Matching on page 115 Incremental Data Load on page 118 Process of Configuring Data Synchronization Between Siebel and Oracle Data Quality Matching Server on page 120 Configuring Deduplication Against Multiple Addresses on page 139
New topic. Describes how to configure a new field for data matching when you use Oracle Data Quality Matching Server for data matching. New topic. Describes how to perform an incremental data load. New topic. Describes how to configure the data source definition required to synchronize data between your Siebel application and Oracle Data Quality Matching Server. New topic. This enhanced functionality is available when you use Oracle Data Quality Matching Server for data matching.
12
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
Table 2. Topic
Changes in Data Quality Guide for Oracle Customer Hub, Version 8.2 (Fix Pack 8.1.1.4) Description New topic. This enhanced functionality is available when you use Oracle Data Quality Matching Server for data matching. New topic. This enhanced functionality is available when you use Oracle Data Quality Matching Server for data matching. New topic. Describes how to configure Oracle Data Quality Address Validation Server for data cleansing. The following vendor parameters provide better control of the data returned by the Oracle Data Quality Address Validation Server: ASM Country Database Return Code ASM High Deliverability Return Code
Configuring Multiple Language Support for Data Matching on page 141 Configuring Multiple Mode Support for Data Matching on page 145 Configuring Siebel Business Applications for the Oracle Data Quality Address Validation Server on page 155
Universal Connector Parameter and Field Mapping Values for Oracle Data Quality Address Validation Server on page 156 Appendix A, Siebel Business Applications Action Sets
New topic. Describes the Universal Connector parameter and field mapping values for the Oracle Data Quality Address Validation Server. Modified appendix. Siebel business application action sets have been expanded to include DQ Sync action sets (ISSSYNC* is replaced with DQ Sync*). The following runtime events must be enabled for DQ Sync action sets: DQ Sync DeleteRecord Contact (Account, Prospect) DQ Sync PreDeleteRecord Contact (Account, Prospect) DQ Sync PreWriteRecord Contact (Account, Prospect) DQ Sync WriteRecord Contact (Account, Prospect) DQ Sync Update Address DQ Sync WriteRecordNew
These run-time events now point to the DQ Sync Services Business Service and facilitate the following: Appendix B, Sample Script Files Appendix F, Upgrading to Informatica Identity Resolution 9.01 Synchronization with Oracle Data Quality Matching Server. Entry into Oracle Data Quality Matching Server database tables.
New appendix. Shows example configuration files and SQL script files. New topic. Describes the changes required to integrate Oracle Data Quality Matching Server with Informatica Identity Resolution 9.01.
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
13
Additional Changes
The following product names have changed: Identity Search Server has changed to Informatica Identity Resolution. Oracle Data Quality Cleansing Server has changed to Oracle Data Quality Address Validation Server.
14
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
This chapter provides an overview of data quality functionality and products for Siebel CRM and Oracle Customer Hub. It includes the following topics: Data Profiling on page 15 Data Parsing and Standardization on page 16 Data Matching and Data Cleansing on page 16 Data Quality Products for Data Matching and Data Cleansing on page 17 How Data Quality Relates to Other Entities in Siebel Business Applications on page 22
Data Profiling
Data profiling typically provides profiling capabilities that are set in an application specifically designed to give control of data quality processes to business information owners, such as data analysts and data stewards. Data profiling also provides data analysis, reporting, and monitoring capabilities. When data quality is measured, it can be effectively managed. Data profiling provides the metrics and reports that business information owners need to continuously measure, monitor, track, and improve data quality at multiple points across the organization. Data profiling also enables business information owners and IT (information technology) to work together to deploy lasting data quality programs. Business information owners use data profiling to build data quality rules and define data quality targets together with the IT team, which then manages deployment enterprise-wide. You can use data profiling to: Analyze and rank data according to completeness, conformity, consistency, duplication, integrity, and accuracy (you must use rules and reference data to analyze and rank data). Identify, categorize, and quantify low-quality data
For more information about data profiling and Oracle data profiling offerings, see Oracle Fusion Middleware Upgrade Guide for Oracle Data Integrator 11g Release 1 on Oracle Technology Network (http://www.oracle.com/technetwork/indexes/documentation/index.html).
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
15
For more information about data parsing and standardization and Oracle offerings within the data parsing and standardization arena, see Oracle Fusion Middleware Upgrade Guide for Oracle Data Integrator 11g Release 1 on Oracle Technology Network (http://www.oracle.com/technetwork/ indexes/documentation/index.html).
Data Cleansing
Data cleansing is used to correct data and make data consistent in new or modified customer records and typically consists of the following functions: Automatic population of fields in addresses. If a user enters valid values for Zip Code, City, and Country, data quality automatically supplies a State field value. Likewise, if a user enters valid values for City, State, and Country, data quality automatically supplies a Zip Code value. Address correction. Data quality stores street address, city, state, and postal code information in a uniform and consistent format, as mandated by U.S. postal requirements. For recognized U.S. addresses, address correction provides ZIP+4 data correction and stores the data in certified U.S. Postal Service format. For example, 100 South Main Street, San Mateo, CA 94401 becomes 100 S. Main St., San Mateo, CA 94401-3256. Capitalization. Based on configuration, data quality converts fields for account, contact, prospect, and address to mixed case, all lowercase, or all uppercase. Standardization. Data quality ensures account, contact, and prospect information is stored in a uniform and consistent format. For example, IBM Corporation becomes IBM Corp.
Data cleansing is supported for the Account, Business Address, Contact, and List Mgmt Prospective Contact business components. For each business component, particular fields are used in data cleansing and this set of fields is configurable.
16
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
Overview of Data Quality Data Quality Products for Data Matching and Data Cleansing
Data Matching
Data matching is the identification of potential duplicates for account, contact, and prospect records. Potential duplicate records are displayed in the Siebel application allowing you to manually merge duplicate records into a single record. Data matching is supported for the Account, Contact, and List Mgmt Prospective Contact business components. For each business component, a set of fields is used for comparisons in the data matching process. The set of fields is configurable, and you can also specify other matching preferences such as the degree of matching required for records to be identified as potential duplicates. TIP: The term deduplication is often used as a synonym for data matching particularly in names of system parameters. In data quality you can enable and use both data cleansing and data matching at the same time, or you can use data cleansing and data matching on their own.
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
17
Overview of Data Quality Data Quality Products for Data Matching and Data Cleansing
Related Topic Related Data Quality Objects in Siebel Tools on page 253
The Oracle Data Quality Matching Server connector uses the Universal Connector in a mode where match candidate acquisition takes place within the Oracle Data Quality Matching Server, not within Siebel CRM. Since the match keys are generated and stored within the Oracle Data Quality Matching Server, key generation and key refresh operations are eliminated within Siebel CRM. This integration,
18
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
Overview of Data Quality Data Quality Products for Data Matching and Data Cleansing
whereby match candidate acquisition takes place within the Oracle Data Quality Matching Server cannot be used by other third-party data quality matching engines. For more information about Oracle Data Quality Matching Server installation and configuration, see Process of Installing the Oracle Data Quality Matching Server on page 90 and Configuring Oracle Data Quality Matching Server on page 100. For more information about Informatica Identity Resolution, see the relevant documentation included in Siebel Business Applications Third-Party Bookshelf in the product media pack on Oracle Software Delivery Cloud.
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
19
Overview of Data Quality Data Quality Products for Data Matching and Data Cleansing
For more information about Oracle Data Quality Address Validation Server installation and configuration, see Process of Installing the Oracle Data Quality Address Validation Server on page 149 and Configuring Siebel Business Applications for the Oracle Data Quality Address Validation Server on page 155. For more information about Informatica Identity Resolution, see the relevant documentation included in Siebel Business Applications Third-Party Bookshelf in the product media pack on Oracle Software Delivery Cloud.
20
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
Overview of Data Quality Data Quality Products for Data Matching and Data Cleansing
Universal Connector
NOTE: In previous releases, Universal Connector was known as SDQ Universal Connector. The Universal Connector is a connector to third-party software that allows Siebel CRM to use the capabilities of a third-party application for data matching, data cleansing, or both data matching and data cleansing on account, contact, and prospect data within the Siebel application. The Universal Connector supports data cleansing on account, contact, and prospect data in real-time and batch processing modes. The Universal Connector works across various languages and operating systems, though the support offered by particular third-party software for data matching or data cleansing might not cover all of the languages supported by Siebel Business Applications. For more information about: Platforms supported, see Siebel System Requirements and Supported Platforms on Oracle Technology Network. Third-party software, see the relevant documentation included in Siebel Business Applications Third-Party Bookshelf in the product media pack on Oracle Software Delivery Cloud.
To use the Universal Connector, you must obtain, license, and install third-party software in addition to obtaining Siebel Data Quality product licensing. The data matching and data cleansing capabilities of the Universal Connector are driven by the capabilities and configuration options of the third-party software. NOTE: Certain third-party software from data quality vendors are certified by Oracle. For information about third-party solutions and about products that are certified for the Universal Connector, visit the Alliances section and the Partners section on the Oracle and Siebel Web site: http://www.oracle.com/siebel/index.html The Universal Connector can be used in two different modes: The Oracle Data Quality Matching Server connector uses the Universal Connector in a mode where match candidate acquisition takes place within the Oracle Data Quality Matching Server. This mode applies only to the Oracle Data Quality Matching Server. Third-party data quality vendors use the Universal Connector in a mode where match candidate acquisition takes place within Siebel CRM.
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
21
Overview of Data Quality How Data Quality Relates to Other Entities in Siebel Business Applications
You can configure the Universal Connector to specify which fields are used for data cleansing and data matching and their mapping to external application field names. NOTE: The Oracle Data Quality License is valid only for use with Oracle Master Data Management and Oracle CRM deployments.
22
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
Overview of Data Quality How Data Quality Relates to Other Entities in Siebel Business Applications
Data matching and data cleansing can also be enabled for the Enterprise Application Integration (EAI) adapter and Oracles Siebel Universal Customer Master (UCM) products. For more information about business services and enabling data quality when using EAI, see Integration Platform Technologies: Siebel Enterprise Application Integration.
Figure 1.
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
23
Overview of Data Quality How Data Quality Relates to Other Entities in Siebel Business Applications
24
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
This chapter provides the conceptual information that you must use to configure data quality for Siebel CRM and Oracle Customer Hub. It includes the following topics: Data Cleansing on page 25 Data Matching on page 26 Match Key Generation on page 27 Identification of Candidate Records on page 29 Calculation of Match Scores on page 30 Displaying Duplicates on page 31 Fuzzy Query on page 32
Data Cleansing
The Universal Connector supports data cleansing on the Account, Business Address, Contact, and List Mgmt Prospective Contact business components. For Siebel Industry Applications, the CUT Address business component is used instead of the Business Address business component. NOTE: Functionality for the CUT Address business component and Personal address business component varies. For example, only unique addresses can be associated with Contacts or Accounts when using the Personal Address. In contrast, the CUT Address does not populate the S_ADDR_PER.PER_ID table column, thereby allowing non-unique records to be created according to the S_ADDR_PER_U1 unique index and associated user key. For each type of record, data cleansing is performed for the fields that are specified in the Third Party Administration view. The mapping between the Siebel application field names and the vendor field names is defined for each business component. In real-time mode, data cleansing begins when a user saves a newly created or modified record. When the record is committed to the Siebel database:
1 2 3 4
A request for cleansing is automatically submitted to the Data Cleansing business service. The Data Cleansing business service sends the request to the third-party data cleansing software, along with the applicable data. The third-party software evaluates the data and modifies it in accordance with the vendors internal instructions. The third-party software sends the modified data to the Siebel application, which updates the database with the cleansed information and displays the cleansed information to the user.
In batch mode you use batch jobs to perform data cleansing on all the records in a business component or on a specified subset of those records. For data cleansing batch jobs, the process is similar to that for real-time mode, but the batch job corrects the records without immediately
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
25
displaying the changes to users. The process starts when an administrator runs the server task, and the process continues until all the specified records are cleansed. If both data cleansing and data matching are enabled, data cleansing is done first. For information about running data cleansing batch jobs, see Cleansing Data Using Batch Jobs on page 212.
Data Matching
The Universal Connector and Matching Server supports data matching on the Account, Contact, and List Mgmt Prospective Contact business components. For each type of record, data matching is performed for the current record against all other records of the same type, and with the same match keys, in the application using the fields specified in the Third Party Administration view. The mapping between the Siebel application field names and the vendor field names is defined for each business component. Data quality performs matching using fields, for example, addresses, that can have multi-value group (MVG) values associated with the type of record being matched. However, data quality is not currently able to match using MVGs. Therefore, when performing matching for a contact, data quality checks only the primary address for each contact record and does not consider other addresses. In real-time data matching, whenever an account, contact, or prospect record is committed to the database, a request is automatically submitted to the Deduplication business service. The business service communicates with third-party data quality software, which checks for possible matches to the newly committed record and reports the results to the Siebel application. In batch mode data matching, you first start a server task to generate or refresh the keys, and then start another server task to perform data matching. For information about performing batch mode data matching, see Matching Data Using Batch Jobs on page 213. In both real-time and batch mode, whenever a primary address is updated for an account or contact record, match keys are regenerated and data matching is performed for that account or contact. The following is the overall sequence of events in data matching:
1 2
Match keys are generated for database records for which data matching is enabled. When a user enters or modifies a record in real-time mode, or the administrator submits a batch data matching job:
a b c d e 3
A request is automatically submitted to the Deduplication business service. Using match keys, candidate matches are identified for each record. This is a means of filtering the potential matching records. The Deduplication business service sends the candidate records to the third-party software. The third-party software evaluates the candidate records and calculates a match score for each candidate record to identify the duplicate records. The third-party software returns the duplicate records to the Siebel application.
The duplicate records are displayed either in a window for real-time mode, or in the Administration - Data Quality views, from which you can manually merge records into a single record.
26
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
NOTE: If using the Oracle Data Quality Matching Server for data matching, then you carry out deduplication against either the primary address or all address entities depending on configuration. For more information about deduplication against multiple addresses, see Configuring Deduplication Against Multiple Addresses on page 139.
If you generate match keys with a search specification that covers record 1, 2, and 3, new keys are generated for record 1, 2, and 3. However, if you refresh match keys with a search specification to cover record 1, 2, and 3, new keys are generated for record 2 and 3 only. The batch capability is useful in the following circumstances: If you deploy data quality in a Siebel application implementation that already contains data If you receive new data using an input method that does not involve object manager, such as EIM or batch methods such as the List Import Service Manager To periodically review data to ensure the correctness of previous matching efforts.
For instructions about using batch jobs to generate or refresh keys, see Generating or Refreshing Keys Using Batch Jobs on page 215.
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
27
Additionally, if real-time data matching is enabled for users, keys are automatically generated (or refreshed) for a record whenever the user saves a new Account, Contact, or List Mgmt Prospective Contact record or modifies and commits an existing record to the database. If no keys are generated for a certain record, that record is ignored as a potential candidate record when matching takes place.
Match Key Generation with the Oracle Data Quality Matching Server
When the Universal Connector is integrated with the Oracle Data Quality Matching Server for data matching, it supports data matching on account, contact, and prospect data in real-time and batch processing modes. Whenever a record is created or updated in real-time or batch mode, match keys are generated by and stored within the Oracle Data Quality Matching Server. As a result, the information in Match Key Generation on page 27 does not apply.
Match Key Generation Using the Universal Connector with Third-Party Data Quality Vendors
When the Universal Connector is integrated with any other third-party data quality vendor software for data matching, match key generation is as described in this topic. That is, match keys are generated by and stored within Siebel CRM whenever a record is created or updated in real-time or in batch mode. The Universal Connector uses one or multiple keys for each account, contact, or prospect record. The keys are calculated by reading data from specific fields in the record. The fields used depend on the business component configuration, but they can include account name, postal code, street address, or last name fields. The value of the match keys depend on a business component-specific Dedup Token Expression parameter, as shown in Table 3 on page 30. You can customize the Dedup Token Expression but it must be consistent with the internal matching logic of the vendor, which is different for each vendor. For optimal results therefore, change the values only after consulting the relevant vendor. The generation of multiple match keys enhances the span of search for potential duplicate records, and improves match results. However, you must remember that there is a performance impact from using multiple keys. Keys are stored in the DEDUP_TOKEN fields of the following tables: S_DQ_ORG_KEY (for Accounts) S_DQ_CON_KEY (for Contacts) S_DQ_PRSP_KEY (for Prospects)
28
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
You must activate the Dedup Token field in each business component in order to generate the correct match keys. If the Dedup Token field is not defined, match key generation methods will not be called. You must add the user property for the Token Expression along with the Query Expression so that the correct match keys can be generated and stored in the DEDUP_TOKEN field. NOTE: In Siebel CRM 7.8.x, the column DEDUP_TOKEN is available in the following tables: S_CONTACT, S_ORG_EXT, S_PRSP_CONTACT.
Identification of Candidate Records with the Oracle Data Quality Matching Server
When using the Oracle Data Quality Matching Server for data matching, identification of candidate records is irrelevant as match candidate acquisition takes place within the Oracle Data Quality Matching Server.
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
29
See Table 3 for examples about how the default expressions can differ for different business components.
Table 3.
Expressions Used for Keys and Queries (Example) Dedup Token Expression Parameter (Key) "IfNull (Left ([Primary Account Postal Code], 5), '_____') + IfNull (Left ([Name], 1), '_') + IfNull (Mid ([Street Address], FindNoneOf ([Street Address], '1234567890 '), 1), '_')" "IfNull (Left ([Postal Code], 5), '_____') + IfNull (Left ([Account], 1), '_') + IfNull (Left ([Last Name], 1), '_')" "IfNull (Left ([Postal Code], 5), '_____') + IfNull (Left ([Account], 1), '_') + IfNull (Left ([Last Name], 1), '_')" Dedup Query Expression Parameter (for Queries) "IfNull (Left ([Primary Account Postal Code], 5), '?????') + IfNull (Left ([Name], 1), '?') + IfNull (Mid ([Street Address], FindNoneOf ([Street Address], '1234567890 '), 1), '?')" "IfNull (Left ([Postal Code], 5), '?????') + IfNull (Left ([Account], 1), '?') + IfNull (Left ([Last Name], 1), '?')" "IfNull (Left ([Postal Code], 5), '?????') + IfNull (Left ([Account], 1), '?') + IfNull (Left ([Last Name], 1), '?')"
Contact
The maximum number of candidate records that are sent to the third-party software at one time is determined by the value of the following vendor parameters in the Third Party Administration view: Realtime Max Num of Records. Used in real time, the default value is 200, which is the highest value that you can set. Usually there will not be more than 200 records to send, but if there are more than 200 records, the first 200 records are sent. Batch Max Num of Records. Used in batch mode, the default is 200, which is the highest value that you can set. If there are more than 200 records to send, the first 200 records are sent, then up to 200 records in the next iteration, and so on.
NOTE: Information in this topic does not apply if using the Oracle Data Quality Matching Server for data matching as match candidate acquisition takes place within the Oracle Data Quality Matching Server.
30
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
Calculation of Match Scores Using the Universal Connector with ThirdParty Data Quality Vendors
The third-party software examines the candidate records, computes a match score for each record that is identified as a duplicate, and returns the duplicate records to data quality. The match score is a number that represents the similarity of a record to the current active record. It is calculated taking into account a large number of rules along with a number of other factors and weightings.
Displaying Duplicates
NOTE: This applies to all data quality products. After calculating match scores, the third-party software returns duplicate records to the Siebel application. In real-time mode, the Siebel application displays the duplicate records in a window. These windows are: DeDuplication Results (Account) List Applet DeDuplication Results (Contact) List Applet DeDuplication Results (Prospect) List Applet
You can however, configure the names of these windows as described in Configuring the Windows Displayed in Real-Time Data Matching on page 56. The user can either choose a record for the current record to be merged with, or click Ignore to leave the possible duplicates unchanged. For more information, see Real-Time Data Cleansing and Data Matching on page 206. In batch mode, duplicate records are displayed in the Duplicate Account Resolution, Duplicate Contact Resolution, and Duplicate Prospect Resolution views in the Administration - Data Quality screen and also in the following views: Account Duplicates Detail View Contact Duplicates Detail View List Mgmt Prospective Contact Duplicates Detail View.
The user can then decide about which records to retain or merge with the retained records. For information about merging records, see Merging of Duplicate Records on page 219. If data cleansing is enabled for Siebel Universal Customer Master, you can use the following views of the Administration - Universal Customer Master screen to display duplicates: UCM Account Duplicates Detail View UCM Contact Duplicates Detail View
The default data quality views for accounts and contacts must be disabled. There is no separate UCM view for prospects.
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
31
Fuzzy Query
Fuzzy query is an advanced query feature that makes searching more intuitive and effective. It uses fuzzy logic to enhance your ability to locate information in the database. Fuzzy query is useful in customer interaction situations for locating the correct customer information with imperfect information. For example, fuzzy query makes it possible to find matches even if the query entries are misspelled. As an example, in a query for a customer record for Stephen Night, you can enter Steven Knight and records for Stephen Night as well as similar entries like Steve Nite are returned. Standard query methods can rule out rows due to lack of exact matches, whereas fuzzy query does not rule out rows that contain only some of the query specifications. The fuzzy query feature is most useful for queries on account, contact, and prospect names, street names, and so on. Fuzzy query operates as follows:
1 2
A user enters a query from the Siebel application GUI. Data quality inspects the query for wildcard characters, such as the * (asterisk). If any wildcards are present, data quality uses standard query functionality for that query, not fuzzy query functionality. Data quality generates a Dedup Token from certain specified fields in the current query input, and uses the token to query the database for possible data matches. Data quality preserves query text in fields that the DeDuplication service does not evaluate for potential data matches. For more information about Dedup Tokens, see Identification of Candidate Records on page 29. The remainder of the process depends on the number of records that are returned in the previous step:
If the preliminary query results contain more records than the value of the Fuzzy Query Max Results setting, then data quality calls the DeDuplication business service, which works with the third-party data matching engine to evaluate the possible matches. The query result returns the best available matches, up to the number of records specified by Fuzzy Query Max Results. If the preliminary query results contain fewer records than the value of the Fuzzy Query Max Results setting, then data quality returns all of those records as the query result, sorted according to the default sort specification for the business component.
Fuzzy query is not enabled by default; to use fuzzy query you must enable it and ensure that other conditions are met as described in Enabling and Disabling Fuzzy Query on page 43.
32
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
This chapter describes how to enable data matching and data cleansing, and describes the data quality settings that you can apply for Siebel CRM and Oracle Customer Hub. Data cleansing and data matching must be enabled before you perform data quality tasks. This chapter includes the following topics: Levels of Enabling and Disabling Data Cleansing and Data Matching on page 33 Enabling Data Quality at the Enterprise Level on page 35 Specifying Data Quality Settings on page 37 Enabling Data Quality at the Object Manager Level on page 39 Enabling Data Quality at the User Level on page 42 Disabling Data Cleansing for Specific Records on page 43 Enabling and Disabling Fuzzy Query on page 43 Identifying Mandatory Fields for Fuzzy Query on page 45
Values set at the enterprise level Administration - Server Configuration, Enterprises, Data Quality Administration - Server Configuration, Enterprises, Parameters Enable button Disable button DeDuplication Data Type Data Cleansing Type Data Matching Vendor Name Data Cleansing Vendor Name Not applicable Application administrator Application administrator
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
33
Enabling and Disabling Data Matching and Data Cleansing Levels of Enabling and Disabling Data Cleansing and Data Matching
Table 4.
Levels of Enabling and Disabling Data Matching and Cleansing Setting or Parameter Value Parameters Set By
Values set for Data Quality Settings NOTE: These settings affect all the servers. Administration - Data Quality, Data Quality Settings Enable DataCleansing Enable DeDuplication Values set at the object manager level Administration - Server Configuration, Servers, select component Data Quality Manager, then click the Parameters tab Data Cleansing Enable Flag Data Cleansing Type DedDuplication Enable Flag DeDuplication Data Type Administration - Server Configuration, Servers, select Object manager of application (for example, Sales Object Manager (ENU)), then click the Parameters tab Data Cleansing Enable Flag Data Cleansing Type DedDuplication Enable Flag DeDuplication Data Type Values set at the user level Tools, User Preferences, Data Quality Enable DataCleansing Enable DeDuplication Yes or No Data steward and end users NOTE: A data steward monitors the quality of incoming and outgoing data for an organization. True or False Data Cleansing Vendor Name True or False Data Matching Vendor Name True or False Data Cleansing Vendor Name True or False Data Matching Vendor Name Data administrator Data administrator Yes or No Data administrator
The values of parameters at the user level override the values at the object manager level. In turn, the values at the in the object manager level override the settings specified at the enterprise level. This allows administrators to enable data matching or cleansing for one application but not another and allows users to disable data matching or cleansing for their own login even if data matching or cleansing is enabled for their application.
34
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
Enabling and Disabling Data Matching and Data Cleansing Enabling Data Quality at the Enterprise Level
However, data matching or data cleansing cannot be enabled for a user login if data matching or data cleansing are not enabled at the object manager level. Even if data cleansing and data matching are enabled, cleansing and matching are only triggered for business components as defined in Siebel Tools and in the Data Quality - Administration views. For more information, see the following topics: Enabling Data Quality at the Enterprise Level on page 35 Specifying Data Quality Settings on page 37 Enabling Data Quality at the Object Manager Level on page 39 Enabling Data Quality at the User Level on page 42
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
35
Enabling and Disabling Data Matching and Data Cleansing Enabling Data Quality at the Enterprise Level
In the Component Groups list, select Data Quality, and then click the Enable button. Data quality is now enabled at the enterprise level for data matching and data cleansing.
Use the following procedure to configure data matching and data cleansing settings at the enterprise level.
To configure data matching and data cleansing settings at the enterprise level 1 2 3 4
Log in to the Siebel application with administrator responsibilities. Navigate to the Administration - Server Configuration screen, then Enterprises view. Click the Parameters view tab. In the Parameter field in the Enterprise Parameters list, query and review the settings for each of the following parameters:
CHANGE_ME. Indicates that you chose None when you installed the Siebel Server. name of third-party server. Indicates the name of the third-party server that is being used for data matching and (or) data cleansing. For example:
ISS. Indicates that Oracle Data Quality Matching Server is used for data matching. ASM. Indicates that Oracle Data Quality Address Validation Server is used for data cleansing.
If necessary, enter any corrections in the Value field. The value you choose for Data Cleansing Type can differ from the value you choose for DeDuplication Data Type, provided that you have the appropriate vendor software available. NOTE: The values set in the Value field in the Enterprise Parameters list also appear in the Value fields for the corresponding parameters in the Component Parameters and Server Parameters views.
If you change an enterprise parameter in Step 4 (or if you change any value of a server component such as Data Quality Manager), restart the server component so that the new settings take effect. For more information about restarting server components, see Siebel System Administration Guide.
36
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
Enabling and Disabling Data Matching and Data Cleansing Specifying Data Quality Settings
Log out of the application and log back in for the changes to take effect. NOTE: You do not have to restart the Siebel Server.
Table 5 describes the parameters that apply to all data quality products.
Table 5.
Data Quality Settings Applicable to Data Quality Products Description Determines whether real-time data cleansing is enabled for the Siebel Server the administrator is currently logged into. The default value is Yes. Other values you set for data quality can override this setting. For more information about this, see Levels of Enabling and Disabling Data Cleansing and Data Matching on page 33. Determines whether real-time data matching is enabled for the Siebel Server the administrator is currently logged into. The default value is Yes. Other values you set for data quality can override this setting. For more information about this, see Levels of Enabling and Disabling Data Cleansing and Data Matching on page 33. Determines whether duplicate records are displayed in a window when a user saves a new account record. The user can then merge duplicates. If set to No, duplicates are not displayed in a window, but the user can merge duplicates in the Duplicate Accounts view. The default value is Yes. For more information about window configuration, see Configuring the Windows Displayed in Real-Time Data Matching on page 56. Determines whether duplicate records are displayed in a window when a user saves a new contact record. The user can then merge duplicates. If set to No, duplicates are not displayed in a window, but the user can merge duplicates in the Duplicate Contacts view. The default value is Yes. For more information about window configuration, see Configuring the Windows Displayed in Real-Time Data Matching on page 56.
Enable DeDuplication
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
37
Enabling and Disabling Data Matching and Data Cleansing Specifying Data Quality Settings
Table 5.
Data Quality Settings Applicable to Data Quality Products Description Determines whether duplicate records are displayed in a window when a user saves a new prospect record. The user can then merge duplicates. If set to No, duplicates are not displayed in a window, but the user can merge duplicates in the Duplicate Prospects view. The default value is Yes. For more information about window configuration, see Configuring the Windows Displayed in Real-Time Data Matching on page 56. Determines whether fuzzy query, an advanced search feature, is enabled. The default value is no. For more information about fuzzy query, see Enabling and Disabling Fuzzy Query on page 43 Specifies the maximum number of records returned when a fuzzy query is performed. The default value is 500. For more information about fuzzy query, see Enabling and Disabling Fuzzy Query on page 43. If set to Primary Address, then only the primary address associated with an account is considered for deduplication. If set to All Address, then all addresses associated with an account are considered for deduplication. The default value is Primary Address. If set to Primary Address, then only the primary address associated with a contact is considered for deduplication. If set to All Address, then all addresses associated with a contact are considered for deduplication. The default value is Primary Address. Specifies a threshold above which any record with a match score is considered a match. Higher scores indicate closer matches. A perfect match is equal to 100. Possible values are: 50-100. Set to Yes if configuring deduplication against multiple addresses. The default value is No. For more information, see Configuring Deduplication Against Multiple Addresses on page 139. Set to Yes if configuring multiple language support for data matching. The default value is No. For more information, see Configuring Multiple Language Support for Data Matching on page 141.
Account Match Against NOTE: This parameter applies to the Oracle Data Quality Matching Server only. Contact Match Against NOTE: This parameter applies to the Oracle Data Quality Matching Server only. Match Threshold
Enable DQ Multiple Addresses NOTE: This parameter applies to the Oracle Data Quality Matching Server only. Enable DQ Multiple Languages NOTE: This parameter applies to the Oracle Data Quality Matching Server only.
38
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
Enabling and Disabling Data Matching and Data Cleansing Enabling Data Quality at the Object Manager Level
Table 5.
Data Quality Settings Applicable to Data Quality Products Description Set to Yes if configuring data synchronization between the Siebel application and the ODQ Matching Server using the new synchronizer process. The new synchronizer process uses the DQ Sync Services business service to insert synchronized messages directly into the ODQ Matching Server (Informatica Identity Resolution) NSA table, and is triggered by the DQ Sync* action sets in Siebel CRM. The default value is Yes. For the new synchronizer process to work, you must also: Configure the EBC table. For more information, see Process of Configuring Data Synchronization Between Siebel and Oracle Data Quality Matching Server on page 120. Activate the DQ Sync Action Sets.
Parameter Enable DQ Sync NOTE: This parameter applies to the Oracle Data Quality Matching Server only.
NOTE: The old synchronizer uses workflows to send XML messages to the Oracle Data Quality Matching Server XS Server (XML Sync Server), and is triggered by the ISSSYNC action sets in Siebel CRM. Sort Match Web Service Results Set to Yes to enable the sort filter for the results in the Data Quality Web Services. The default is No.
Disabling Data Matching and Cleansing Without Restarting the Siebel Server
If you enabled data matching or cleansing from the Administration - Server Configuration screen, you can disable one or both from the Data Quality Settings view without restarting the Siebel Server. After you disable data matching or data cleansing, log out and then log in to the application again for the new settings to take effect. The settings apply to all the object managers in your Siebel Server, whether or not they have been enabled in the Administration - Server Configuration screen.
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
39
Enabling and Disabling Data Matching and Data Cleansing Enabling Data Quality at the Object Manager Level
To enable data matching and data cleansing for real-time processing at the object manager level, you must enable certain parameters for the object manager that the application uses. You enable real-time processing for data matching and cleansing using either the graphical user interface (GUI) of the Siebel application or the command-line interface of the Siebel Server Manager. NOTE: The command-line interface of the Siebel Server Manager is the srvrmgr program. For more information about using the command-line interface, see Siebel System Administration Guide. Use the following procedures to enable data matching and cleansing for real-time processing: Enabling Data Quality Using the GUI on page 40 Enabling Data Quality Using the Command-Line Interface on page 41
These procedures require that data quality is already enabled at the enterprise level. For information about enabling data quality at the enterprise level, see Enabling Data Quality at the Enterprise Level on page 35.
To enable data quality at the object manager level using the GUI 1 2 3
Log in to the Siebel application with administrator responsibilities. Navigate to the Administration - Server Configuration screen, then the Servers view. In the Components list, select an object manager where end users enter and modify customer data. For example, select the Call Center Object Manager (ENU) if you want to enable or disable realtime data matching or cleansing for that object manager.
4 5
Click the Parameters subview tab. In the Parameters field in the Component Parameters list, apply the appropriate settings to the parameters listed in the following table to enable or disable data matching or cleansing. Field Data Cleansing Enable Flag Description Indicates whether real-time data cleansing is enabled for a specific object manager, such as Call Center Object Manager (ENU). This parameter allows you to set different data cleansing values in different object managers. By default, all values for this parameter are set to False. Indicates whether real-time data matching is enabled for a specific object manager, such as Call Center Object Manager (ENU). This parameter allows you to set different data matching values in different object managers. By default, all values for this parameter are set to False.
40
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
Enabling and Disabling Data Matching and Data Cleansing Enabling Data Quality at the Object Manager Level
Description Indicates the third-party vendor software that is used for data cleansing. Indicates the third-party vendor software that is used for data matching.
NOTE: The settings at this object manager level override the enterprise-level settings.
After the component parameters are set, restart the object manager either by using srvrmgr or by completing the following sub-steps:
a b c d
Navigate to the Administration - Server Management screen, then the Servers view. Click the Components Groups view tab (if not already active). In the Servers list (upper applet), select the appropriate Siebel Server (if you have more than one in your enterprise). In the Components Groups list (middle applet), select the component of your object manager, and use the Startup and Shutdown buttons to restart the component.
For information about restarting server components, see Siebel System Administration Guide.
To enable data quality at the object manager level using the Siebel Server Manager command-line interface 1
Start the Siebel Server Manager command-line interface (srvrmgr) using the user name and password of a Siebel application administrator account such as SADMIN. For more information, see Siebel System Administration Guide. NOTE: You must have Siebel CRM administrator responsibility to start or run Siebel Server tasks using the Siebel Server Manager command-line interface.
Execute commands similar to the following examples to enable or disable data matching or data cleansing. The examples are for the Call Center English application (where SSCObjmgr_enu is the alias name of the English Call Center object manager of the Call Center application.) Use the appropriate alias_name for the application component name to which you want the change applied:
To enable data matching if you are using Universal Connector third-party software: change parameter DedDupTypeEnable=True, DeDupTypeType=ISS for component SCCObjMgr_enu
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
41
Enabling and Disabling Data Matching and Data Cleansing Enabling Data Quality at the User Level
To enable data cleansing if you are using Universal Connector third-party software: change parameter DataCleansingEnable=True, DataCleansingType=ASM for component SCCObjMgr_enu
To disable data matching or data cleansing, execute commands like these examples with the DeDupTypeEnable or DataCleansingEnable parameters set to False. For more information about using the command-line interface, see Siebel System Administration Guide.
42
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
Enabling and Disabling Data Matching and Data Cleansing Disabling Data Cleansing for Specific Records
Description Applicable for Universal Connector. Select a threshold above which any record with a match score is considered a match. Higher scores indicate closer matches (a perfect match is equal to 100). Possible values are: 50-100. If no threshold value is supplied in any of the data quality settings, the default value of 50 is used by the Siebel application.
Log out of the application and log back in as the user to initialize the new settings. Do not to restart the Siebel Server.
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
43
Enabling and Disabling Data Matching and Data Cleansing Enabling and Disabling Fuzzy Query
Fuzzy query must be enabled for the current user in the User Preferences - Data Quality view; Fuzzy Query Enabled must be set to Yes. The query must not use wildcards. The query must specify values in fields designated as fuzzy query mandatory fields. For information about identifying the mandatory fields, see Identifying Mandatory Fields for Fuzzy Query on page 45. The query must leave optional fields blank.
The following procedures describe how to enable and disable fuzzy query in the Data Quality Settings. If wildcards (*) or quotation marks (") are used in a fuzzy query, then that fuzzy query will not be effective. Also, if mandatory fuzzy query fields are missing, then fuzzy query is disabled for that particular query.
a b 3
In the Name field, choose Fuzzy Query Enabled. In the Value field, choose Yes.
(Optional) If you want to set a maximum number of returned records, click New to create a new record:
a b
In the Name field, choose Fuzzy Query - Max Returned. In the Value field, enter a number from 10 to 500.
Related Topics Using Fuzzy Query on page 222 Example of Enabling and Using Fuzzy Query with Accounts on page 223
44
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
Enabling and Disabling Data Matching and Data Cleansing Identifying Mandatory Fields for Fuzzy Query
Table 6.
Mandatory Fields for Fuzzy Query, by Business Component Mandatory Fields for Fuzzy Query Name First Name, Last Name First Name, Last Name
In the Business Component User Properties pane, select Fuzzy Query Mandatory Fields, and inspect the field names listed in the Value column. Repeat Step 2 through Step 4 for other business components, as needed.
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
45
Enabling and Disabling Data Matching and Data Cleansing Identifying Mandatory Fields for Fuzzy Query
46
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
This chapter describes the data quality configuration that you can perform for Siebel CRM and Oracle Customer Hub. It covers the following topics: Data Quality Configuration Overview on page 48 Installing the Universal Connector on page 49 Process of Configuring New Data Quality Connectors on page 50 Configuring Vendor Parameters on page 53 Mapping of Vendor Fields to Business Component Fields on page 53 Configuring the Windows Displayed in Real-Time Data Matching on page 56 Configuring the Mandatory Fields for Fuzzy Query on page 58 Data Quality User Properties on page 58 Optimizing Data Cleansing Performance on page 67 Optimizing Data Matching Performance on page 68
NOTE: You must be familiar with Siebel Tools before performing some of the data quality configuration tasks. For more information about Siebel Tools, see Using Siebel Tools and Configuring Siebel Business Applications.
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
47
Configuring Data Quality with Universal Connector Data Quality Configuration Overview
Table 7. Type
Data Quality Configuration Options Configuration Configure new connectors for data matching and data cleansing for the Universal Connector Configure vendor parameters. You can configure the parameters for each of the software vendors. Configure field mappings for business components. You can change or add field mappings. Configure the windows displayed in real-time data matching Configure the mandatory fields for fuzzy search. See... Process of Configuring New Data Quality Connectors on page 50 Configuring Vendor Parameters on page 53 Mapping of Vendor Fields to Business Component Fields on page 53 Configuring the Windows Displayed in Real-Time Data Matching on page 56 Configuring the Mandatory Fields for Fuzzy Query on page 58 Process of Configuring Oracle Data Quality Matching Server (IIR) on page 110 Configuring a New Field for RealTime Data Matching on page 115 Incremental Data Load on page 118
Data Matching
Data Cleansing
Configuring Siebel Business Applications for the Oracle Data Quality Address Validation Server on page 155
48
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
Configuring Data Quality with Universal Connector Installing the Universal Connector
Installing Third-Party Application Software for Use with the Universal Connector
Unlike most other third-party software, you must install third-party software for use with the Universal Connector after you install Siebel Business Applications. Install the third-party software in the SDQConnector directory where your Siebel Business Applications are installed; that is, the Siebel_Server_root/SDQConnector directory. See the documentation provided by the third-party vendor for instructions.
Installing Third-Party Data Cleansing Files for Use with the Universal Connector
To perform data cleansing, the third-party vendor software usually needs a set of files for standardization and data cleansing. For information about specifying the location of such files, see the documentation provided by the third-party vendor.
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
49
Configuring Data Quality with Universal Connector Process of Configuring New Data Quality Connectors
Table 8.
Storage Locations for Universal Connector Library Files by Operating System DLL Storage Locations (Windows) For Siebel Server: Shared Library Storage Locations (UNIX) For Siebel Server:
Siebel_Server_root\bin\
For Developer Web Client:
Siebel_Server_root/lib
Client_root\bin\
Yes For Siebel Server: For Siebel Server:
Siebel_Server_root\bin\langua ge_code
For Developer Web Client:
Siebel_Server_root/lib/ language_code
Client_root\bin\language_code
where language_code is the appropriate language code, such as ENU for U.S. English. NOTE: The DLLs or shared libraries for each vendor can be specific to certain operating systems or external product versions, so it is important that you confirm with your vendor that you have the correct files installed on your Siebel Server. The Universal Connector requires that you install third-party applications on each Siebel Server that has the object managers enabled for data quality functionality. If you plan to test real-time mode using a Siebel Developer Web Client, you must install the third-party Data Quality software on that computer, as well. NOTE: When installing data quality products using the Data Quality Applications product media pack on Oracle Software Delivery Cloud, the DLL or shared library files are copied to a location that depends on the operating system you are using.
1 2
Registering New Data Quality Connectors on page 51 Configuring Business Components and Applets for Data Matching and Data Cleansing on page 51
NOTE: These processes do not cover vendor-specific configuration. You must work with Oracle-certified alliance partners to enhance data quality features for your applications.
50
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
Configuring Data Quality with Universal Connector Process of Configuring New Data Quality Connectors
Configuring Business Components and Applets for Data Matching and Data Cleansing
This topic describes how to configure business components and applets, whether existing ones or new ones you create, for data matching and data cleansing.
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
51
Configuring Data Quality with Universal Connector Process of Configuring New Data Quality Connectors
You can configure existing business components or create additional business components for data matching for the Matching Server and for data matching and data cleansing for the Universal Connector. Typically, you configure existing business components; however, you can create your own business components to associate with connector definitions. For information about how to create new business components and define user properties for those components, see Configuring Siebel Business Applications. NOTE: You must base new business components you create only on the CSSBCBase class to support data cleansing and data matching, or make sure that the business component uses a class whose parent is CSSBCBase. This class includes the specific logic to call the DeDuplication and Data Cleansing business services. To configure business components for data matching and data cleansing, complete the steps in the following procedure. This topic is a step in Process of Configuring New Data Quality Connectors on page 50.
Parameter 1
2 3
Configure the field mappings for each business component and operation. Create a DeDuplication Results business component and add it to the Deduplication business object.
52
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
4 5 6
Configure an applet as the DeDuplication Results List Applet. Configure Duplicate views and add them to the Administration - Data Quality screen. Add the business component user properties as shown in the following table. Property DeDuplication Results BusComp DeDuplication Results List Applet Value The buscomp that you created in Step 3 on page 52. The applet that you created in Step 4 on page 53.
Add a field called Merge Sequence Number to the business component and a user property called Merge Sequence Number Field.
There are mappings for each supported business component and data quality operation (DeDuplication and Data Cleansing). There are preconfigured field mappings for the Oracle Data Quality Matching Server Connector, see Preconfigured Field Mappings for Oracle Data Quality Matching Server on page 112.
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
53
Configuring Data Quality with Universal Connector Mapping of Vendor Fields to Business Component Fields
You can configure the field mappings for a business component to include new fields or modify them to map to different fields. There might also be additional configuration required for particular thirdparty software. NOTE: You must contact the specific vendor for the list of fields they support for data cleansing and data matching and to understand the effect of changing field mappings.
Related Topics Mapping Data Matching Vendor Fields to Siebel Business Components on page 54 Adding a Field Mapping for Data Matching on page 54 Mapping Data Cleansing Vendor Fields to Siebel Business Component Fields on page 55
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
Configuring Data Quality with Universal Connector Mapping of Vendor Fields to Business Component Fields
3 4
Click the BC Vendor Field Mapping view tab. In the BC Operation list, select the record for the required business component and operation
For example, to include a date of birth as a matching criterion, select the record for Contact and DeDuplication. For example, to include a D-U-N-S number as a matching criterion, select the record for Account and DeDuplication.
In the Field Mapping list, create a new record and complete the necessary fields as in the example shown in the following table. Business Component Field Birth Date DUNS Number Mapped Field Contact.Birth Date Account.DUNS Number
If required, modify the corresponding real-time and batch mode data flows to incorporate the new field so that data quality considers the new field during data matching comparisons.
TIP: Only fields that are preconfigured as data cleansing fields in the vendor properties trigger realtime data cleansing when they are modified.
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
55
Configuring Data Quality with Universal Connector Configuring the Windows Displayed in Real-Time Data Matching
2 3 4
In the Vendor List, select the record for the required vendor. Click the BC Vendor Field Mapping view tab. In the BC Operation list, select the record for the required business component and Data Cleansing operation. The field mappings are displayed in the Field Mapping list.
In the Field Mapping list enter the required values for Business Component Field and Mapped Field.
56
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
Configuring Data Quality with Universal Connector Configuring the Windows Displayed in Real-Time Data Matching
Name. DeDuplication Results Applet Value. DeDuplication Results (Account) List Applet
5 6
To configure the real-time Deduplication Window for a child applet (Account Contact view) 1
In Siebel Tools, query for the following applet: Account Contact List Applet
Name. DeDuplication Results Applet Value. DeDuplication Results (Contact) List Applet
3 4
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
57
Configuring Data Quality with Universal Connector Configuring the Mandatory Fields for Fuzzy Query
In the Business Component User Properties pane, select Fuzzy Query Mandatory Fields, and enter the required field names in the Value column.
58
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
Configuring Data Quality with Universal Connector Data Quality User Properties
Table 9. Name
Account - Business Component User Properties Value DQ Vendor DQ Field Mapping Info DQ Mapping Config DQ Vendor Info Account Key Comment These business component user properties for Account specify the business object and business component used in the data quality Administration views in Siebel business application. For the Universal Connector, DQ Key BusComp is used to specify the Name of the buscomp that stored the deduplication key generated in Siebel. For SDQ Matching Server, DeDuplication Key BusComp is used to specify the Name of the buscomp that stored the dedup key generated by SSA. Specifies the Name of the buscomp that will store the returned duplicated record data. Specifies the Name of the pick applet used to prompt the user to resolve duplicates. Specifies the mandatory fields for Fuzzy Query; that is, the query fields that must include values so that the Siebel application can use the fuzzy query mode. Specifies the Name of the child MVG buscomp, and the field in the parent buscomp that comes from this MVG. This business component applies to the data quality Multiple Address Deduplication feature. NOTE: MVG stands for multiple value group.
DQ Business Object DQ Field Mapping Info BC DQ Mapping Config BC DQ Vendor Info BC DQ Key BusComp
DeDuplication Results BusComp DeDuplication Results List Applet Fuzzy Query Mandatory Fields
DQ Associate BC 1
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
59
Configuring Data Quality with Universal Connector Data Quality User Properties
For each field used in Multiple Address Deduplication that comes from the child MVG buscomp, a field user property is specified to map it to the child business component field, as shown in Table 10.
Account Business Component - Field User Properties for Multiple Address Deduplication Name DQ Associate BC Map DQ Associate BC Map DQ Associate BC Map DQ Associate BC Map DQ Associate BC Map Value DQ Associate BC 1: City DQ Associate BC 1: Country DQ Associate BC 1: Postal Code DQ Associate BC 1: State DQ Associate BC 1: Street Address
Primary Account City Primary Account Country Primary Account Postal Code Primary Account State Primary Account Street Address
Contact - Business Component User Properties Value DQ Vendor DQ Field Mapping Info DQ Mapping Config DQ Vendor Info Contact Key Comment These business component user properties for Contact specify the business object and business component used in the data quality Administration views in Siebel business application. For the Universal Connector, DQ Key BusComp is used to specify the Name of the buscomp that stored the deduplication key generated in Siebel. For SDQ Matching Server, DeDuplication Key BusComp is used to specify the Name of the buscomp that stored the dedup key generated by SSA. Specifies the Name of the buscomp that will store the returned duplicated record data. Specifies the Name of the pick applet used to prompt the user to resolve duplicates.
DQ Business Object DQ Field Mapping Info BC DQ Mapping Config BC DQ Vendor Info BC DQ Key BusComp
60
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
Configuring Data Quality with Universal Connector Data Quality User Properties
Contact - Business Component User Properties Value "Last Name", "First Name" Comment Specifies the mandatory fields for Fuzzy Query; that is, the query fields that must include values so that the Siebel application can use the fuzzy query mode. Specifies the Name of the child MVG buscomp, and the field in the parent buscomp that comes from this MVG. This business component applies to the data quality Multiple Address Deduplication feature. NOTE: MVG stands for multiple value group.
DQ Associate BC 1
For each field used in Multiple Address Deduplication that comes from the child MVG buscomp, a field user property is specified to map it to the child business component field, as shown in Table 12.
Contact Business Component - Field User Properties for Multiple Address Deduplication Name DQ Associate BC Map DQ Associate BC Map DQ Associate BC Map DQ Associate BC Map DQ Associate BC Map Value DQ Associate BC 1: City DQ Associate BC 1: Country DQ Associate BC 1: Postal Code DQ Associate BC 1: State DQ Associate BC 1: Street Address
Primary Personal City Primary Personal Country Primary Personal Postal Code Primary Personal State Primary Personal Street Address
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
61
Configuring Data Quality with Universal Connector Data Quality User Properties
List Mgmt Prospective Contact - Business Component User Properties Value DQ Vendor DQ Field Mapping Info DQ Mapping Config DQ Vendor Info Prospect Key Comment These business component user properties for List Mgmt Prospective Contact specify the business object and business component used in the data quality Administration views in Siebel business application. For the Universal Connector, DQ Key BusComp is used to specify the Name of the buscomp that stored the deduplication key generated in Siebel. For SDQ Matching Server, DQ Key BusComp is used to specify the Name of the buscomp that stored the dedup key generated by SSA. Specifies the Name of the buscomp that will store the returned duplicated record data. Specifies the Name of the pick applet used to prompt the user to resolve duplicates. Specifies the mandatory fields for Fuzzy Query; that is, the query fields that must include values so that the Siebel application can use the fuzzy query mode.
DQ Business Object DQ Field Mapping Info BC DQ Mapping Config BC DQ Vendor Info BC DQ Key BusComp
DeDuplication Results BusComp DeDuplication Results List Applet Fuzzy Query Mandatory Fields
DeDuplication Results (Prospect) DeDuplication Results (Prospect) List Applet "Last Name", "First Name"
62
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
Configuring Data Quality with Universal Connector Data Quality User Properties
DeDuplication - Business Service User Properties Value DQ Vendor DQ Vendor Info DQ Mapping Config DQ Field Mapping Info DQ Vendor Parameter Comment These business service user properties for DeDuplication specify the business object and business component used in the data quality Administration views in Siebel business application.
Data Cleansing - Business Service User Properties Value DQ Vendor DQ Vendor Info DQ Mapping Config DQ Field Mapping Info DQ Vendor Parameter Comment These business service user properties for Data Cleansing specify the business object and business component used in the data quality Administration views in Siebel business application.
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
63
Configuring Data Quality with Universal Connector Data Quality User Properties
Data quality uses the DQ Sync Services business service user properties listed in Table 16.
DQ Sync Services - Business Service User Properties Value SyncAccount Account_Business Address SyncContact Contact_INS Personal Address SyncProspect siebeldq IDS_01_IDT_ACCOUNT IDS_01_IDT_CONTACT IDS_01_IDT_PROSPECT 1090 Name|DUNS Number|Id W|W|C 200|120|30 1850 Birth Date|Cellular Phone #|Email Address|First Name Last Name|Home Phone #|Middle Name|Account|Id|Social Security Number|Work Phone # W|W|W|W|W|W|W|C|W|W 120|120|120|200|120|200|200|30| 120|120 These business service user properties specify the Contact record fields, data type, and length. These business service user properties specify the default Informatica Identity Resolution system name and IDT table names. These business service user properties specify the Account record fields, data type, and length. Comment These business service user properties specify the Integration Objects and Components used by the Siebel application to synchronize with Oracle Data Quality Matching Server (Informatica Identity Resolution).
Account Integration Object Name Account Address Component Name Contact Integration Object Name Contact Address Component Name Prospect Integration Object Name ISS System Name Account IDT Name Contact IDT Name Prospect IDT Name Account Record Length Account_DeDupFlds Account_DataType Account_ExtLen Contact Record Length Contact_DeDupFlds
Contact_DataType Contact_ExtLen
64
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
Configuring Data Quality with Universal Connector Data Quality User Properties
DQ Sync Services - Business Service User Properties Value 1770 Account|Cellular Phone #|City|Country|Email Address|First Name Last Name|Home Phone #|Middle Name|Postal Code|Social Security Number|State|Street Address|Work Phone #|Id W|W|W|W|W|W|W|W|W|W|W|W|W |C 200|120|120|60|120|200|120|200| 40|120|40|200|200|30 City|Country|Postal Code|State|Street Address|Address Id W|W|W|W|W|C 200|120|120|40|200|60 INS Personal City|INS Personal Country|INS Personal Postal Code|INS Personal State|INS Personal Street Address|INS Personal Address Id W|W|W|W|W|C 120|40|40|40|200|60 These business service user properties specify the Contact INS Personal Address record fields, data type, and length. These business service user properties specify the Account Business Address record fields, data type, and length. Comment These business service user properties specify the Prospect record fields, data type, and length.
Prospect_DataType Prospect_ExtLen Account_Business Address_DeDupFlds Account_Business Address_DataType Account_Business Address_ExtLen Contact_INS Personal Address_DeDupFlds
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
65
Configuring Data Quality with Universal Connector Data Quality User Properties
DQ Sync Services - Business Service User Properties Value <Enter characters separated by a single space> NOTE: The following special characters are removed automatically from all data (so do not specify them in the DQ Sync Services Business Service user property): ^ (code separator) | (code separator) \r (carriage return) \n (new line) \t (horizontal tab) \f (form feed) Comment This business service user property is used to specify any special characters that need to be removed from data sent to the Oracle Data Quality Matching Server. NOTE: Enter all filter characters separated by a single space. For control characters, enter the ASCII code in the following format: ^nnn (where nnn is the code in decimal). This business service user property is used to specify the Endian of the Operating System where the Oracle Data Quality Matching Server is installed.
Filter Characters
Yes
ISS System Services - Business Service User Properties Value siebeldq siebeldq siebeldq Comment These business service user properties specify the default Informatica Identity Resolution system name for each support object.
66
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
Configuring Data Quality with Universal Connector Optimizing Data Cleansing Performance
ISS System Services - Business Service User Properties Value 01 02 03 Comment These are sample business service user properties. You must enter the actual Informatica Identity Resolution system name and corresponding database number. The default system has a database number of 01.
siebeldq_Denmark siebeldq_UNITED_STATES
Related Topic Configuring Multiple Language Support for Data Matching on page 141
Table 18.
Recommended Data Cleansing Object WHERE Clause Solutions Use This in Your Object WHERE Clause [Last Clnse Date] < [Updated] [Last Clnse Date] IS NULL [Last Clnse Date] < [Updated] OR [Last Clnse Date] IS NULL
To speed up the data cleansing task for large databases, run batch jobs to cleanse a smaller number of records at a time using an Object WHERE clause. For more information about data cleansing for large batches, see Cleansing Data Using Batch Jobs on page 212.
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
67
Configuring Data Quality with Universal Connector Optimizing Data Matching Performance
If the base tables contain many duplicates, more records are inserted in the results table. If different search types are used, a different set of duplicate records might be found and will be inserted into the results table. If you use a low match threshold, the matching process generates more records to the results table.
Remove obsolete result records manually from the S_DEDUP_RESULT table by running SQL statements directly on this table. When a duplicate record is detected, the information about the duplicate is automatically placed in the S_DEDUP_RESULT table, whether or not the same information exists in that table. Running multiple batch data matching tasks therefore results in a large number of duplicate records in the table. Therefore, it is recommended that you manually remove the existing records in the S_DEDUP_RESULT table before running a new batch data matching task. You can remove the records using any utility that allows you to submit SQL statements. NOTE: When truncating the S_DEDUP_RESULT table, all potential duplicate records found for all data matching business components are deleted.
Match performance is slow when the SOH string is set as follows: S_POSTN_CON.CON_LAST_NAME = 'SOH' Setting the following values improves match performance:
Navigate to the Contact Business Component. Add the following user property and value under the Last Name field: User Property Sort Search Optimization Value FALSE
For more information about running batch data matching, see Matching Data Using Batch Jobs on page 213.
68
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
This chapter describes the configuration tasks that you can perform for Oracle Enterprise Data Quality. It covers the following topics: Overview of Oracle Enterprise Data Quality on page 69 Installing Oracle Data Enterprise Data Quality on page 69 Process of Configuring Oracle Enterprise Data Quality for Matching and Address Verification on page 70 Process of Configuring the Staging Database for Oracle Enterprise Data Quality on page 75 Vendor Parameters for Oracle Enterprise Data Quality on page 76 Oracle Enterprise Data Quality Field Mappings for Siebel Business Components on page 77 Configuring Third-Party Key Generation for Oracle Enterprise Data Quality on page 85 Enabling Support of Multiple Child Entity Deduplication for Oracle Enterprise Data Quality on page 85 Data Format for Deduplication of Multiple Child Entities on page 85
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
69
Configuring Data Quality with Oracle Enterprise Data Quality Process of Configuring Oracle Enterprise Data Quality for Matching and Address Verification
Prerequisites
Oracle Enterprise Data Quality Server following the Oracle Enterprise Data Quality Installation Notes from Datanomic. For installation instructions, http://www.oracle.com/technetwork/ middleware/oedq/documentation/index.html. Oracle Enterprise Data Quality Customer Data Services Pack must be installed following the instruction from Oracle Enterprise Data Quality Installation Guide. Initializing Reference Data following the instruction from Oracle Enterprise Data Quality Customer Data Services Pack Installation Guide. Optionally, installing Oracle Enterprise Data Quality Address Verification. This is required if you want to run the Address Clean process with Oracle Enterprise Data Quality Address Verification. Oracle Enterprise Data Quality-Siebel Connector. The Java Runtime Environment (JRE) version 6 or later must be installed on the Siebel server. The version of JRE deployed must match the architecture of the server supported by Siebel CRM. For example, as Siebel supports 32-bit versions of Windows, the 32-bit version of JRE must be installed.
Process of Configuring Oracle Enterprise Data Quality for Matching and Address Verification
Siebel Connector calls Oracle Enterprise Data Quality jobs and Web services for real-time data cleansing and matching. A shared staging database is used to pass data between the Siebel CRM application, and Oracle Enterprise Data Quality when batch jobs are run. This process includes the following tasks: Configuring the Siebel Server Under Windows on page 70 Configuring the Siebel Server Under UNIX on page 71 Configuring the Siebel Server to Use Oracle Enterprise Data Quality on page 72 Configuring a New Data Quality Vendor on page 73
70
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
Configuring Data Quality with Oracle Enterprise Data Quality Process of Configuring Oracle Enterprise Data Quality for Matching and Address Verification
Create a file in the Siebel_Server_root\SDQConnector\ directory with the name dnd.parms. This file is used to point to the installation directory on the same server where the Siebel connector runs. The file must contain the following lines: javalib: [location of the JRE]/bin/client/jvm.dll directory: [the installation directory for the connector] For example: javalib: C:/Program Files/Java/jre1.6/bin/client/jvm.dll directory: C:/SiebelConnector NOTE: The Siebel connector is not language specific, so is not installed in a specific language directory.
If detailed trace information on connector requests is required, add the following lines to the dnd.parms file to enable logging: logfile: sdq%05d.log jlogfile: jsdq%05d.log
Extract the siebelconnector.zip file into the installation directory specified in the dnd.parms file (the default location is: Siebel_Server_root\dnd\install.) This installs the following files:
Jar files for the connector. Library jar files needed by the connector. Properties files used to store configuration information for the Siebel Connector.
Extract and copy the dnd.properties file from the Oracle Enterprise Data Quality Siebel Connector 9.0.4.zip file to the identically named file in the installation directory.
AIX: /native/aix/ppc/ or /native/ppc64, depending on whether a 32-bit or 64-bit version of AIX is in use. Linux: /native/linux Solaris: /native/sparcv9
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
71
Configuring Data Quality with Oracle Enterprise Data Quality Process of Configuring Oracle Enterprise Data Quality for Matching and Address Verification
Create a file in the Siebel_Server_root/SDQConnector directory with the name dnd.parms. This file will be used to point to the installation directory on the same server where the Siebel connector will run. The file must contain the following lines: javalib: [location of the JRE]/lib/i386/client/libjvm.so directory: [the installation directory for the connector] For example: javalib: /usr/java/jre1.6/lib/i386/client/libjvm.so directory: /opt/siebel/dnd/install
If detailed trace information on connector requests is required, add the following lines to the dnd.parms file to enable logging: logfile: sdq%05d.log jlogfile: jsdq%05d.log
Extract the siebelconnector.zip file into the installation directory specified in the dnd.parms file (the default being /opt/siebel/dnd/install). This installs the following files:
Jar files for the connector Library jar files needed by the connector Properties files used to store configuration information for the Siebel Connector
Extract and copy the dnd.properties file in the EDQ-CDS.zip file over the identically named file in the installation directory.
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
Configuring Data Quality with Oracle Enterprise Data Quality Process of Configuring Oracle Enterprise Data Quality for Matching and Address Verification
8 9
Click the Servers link, and locate the Data Quality Manager component, then click the Parameters tab. Make sure the following parameter values are set to EDQ:
10 In the Component Parameters view, make sure the following parameters are set to True:
11 Find the component for the Siebel application which is to use EDQ-CDS, such as the UCM Object
Manager (ENU).
12 Repeat Step 5 through Step 11 for any other components that you want to enable for EDQ-CDS.
Add the following options to enable a pop-up dialog box to display possible duplicate records for each new record of a given type when the real-time duplicate prevention service is running. If these options are not specified, and the real-time duplicate prevention service is running, the check will still be performed, but you must navigate to the Duplicate Accounts, Duplicate Contacts or Duplicate Prospects tab when adding a new record:
Force User DeDupe - Account = Yes Force User DeDupe - Contact = Yes Force User DeDupe - List Mgmt = Yes
Click the Third Party Administration link, and in the Vendor list add a new entry with EDQ as the Name value, and dnd at the DLL Name value.
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
73
Configuring Data Quality with Oracle Enterprise Data Quality Process of Configuring Oracle Enterprise Data Quality for Matching and Address Verification
With the newly created vendor record selected, click the BC Vendor Field Mapping tab the enter the following information: Business Component Name Account Account Contact Contact CUT Address List Mgmt Prospective Contact List Mgmt Prospective Contact Personal Address Operation Data Cleansing DeDuplication Data Cleansing DeDuplication Data Cleansing Data Cleansing DeDuplication Data Cleansing
If multiple child entities are enabled, add the following entries: Business Component Name CUT Address Personal Address Operation DeDuplication DeDuplication
In the Field mappings list, set the field mappings for each business component you entered in the BC Operation list. For more information, see Mapping of Vendor Fields to Business Component Fields on page 53. NOTE: Field mappings can be added or changed as required, provided that they correspond to the configured Web Services in Oracle Enterprise Data Quality (for real-time operations), and the names of the columns in the tables of the staging database (for batch operations).
In the Vendor screen, make sure the EDQ vendor is still selected, then click the Vendor Parameter tab.
74
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
Configuring Data Quality with Oracle Enterprise Data Quality Process of Configuring the Staging Database for Oracle Enterprise Data Quality
Process of Configuring the Staging Database for Oracle Enterprise Data Quality
The staging database is used by the Siebel Connector as a staging area for batch matching jobs on the master data. NOTE: In Oracle Customer Hub (UCM), batch matching is normally performed using the Oracle Enterprise Data Quality real-time matching services, so that all connected source systems are considered during matching. However, for both Siebel CRM and Oracle Customer Hub (UCM) the Siebel Data Quality Manager service can run batch jobs on the master data only. Candidates for matching are written to a table to be read by an Oracle Enterprise Data Quality job. Oracle Enterprise Data Quality then exports duplicates from the matching process to a table to be read back into the Siebel database. This process includes the following tasks: Creating Tables on page 75 Configuring Connections on page 75
Creating Tables
This task is a step in Process of Configuring the Staging Database for Oracle Enterprise Data Quality on page 75. The sql folder in the root of the EDQ-CDS distribution contains six scripts that can be run to create tables as follows: siebel_account_candidates.sql creates the SIEBEL_ACCOUNT_CANDIDATES table. siebel_account_matches.sql creates the SIEBEL_ACCOUNT_MATCHES table. siebel_contact_candidates.sql creates the SIEBEL_CONTACT_CANDIDATES table. siebel_contact_matches.sql creates the SIEBEL_CONTACT_MATCHES table. siebel_prospect_candidates.sql creates the SIEBEL_PROSPECT_CANDIDATES table siebel_prospect_matches.sql creates the SIEBEL_PROSPECT_MATCHES table
Configuring Connections
This task is a step in Process of Configuring the Staging Database for Oracle Enterprise Data Quality on page 75. Both the Siebel Connector and EDQ-CDS itself must connect to the staging database to read and write to the Candidate and Match tables when processing batch jobs. These tables can be created in any database and schema, but by default the connection is configured for an Oracle database.
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
75
Configuring Data Quality with Oracle Enterprise Data Quality Vendor Parameters for Oracle Enterprise Data Quality
The connection details are specified in the dnd.properties file, see Process of Configuring Oracle Enterprise Data Quality for Matching and Address Verification on page 70. To configure these, open the dnd.properties file and edit the six parameters in the Staging database connection details section near the top of the file. These parameters control the database host, port, credentials and other settings used to connect to the Staging Database.
Account DataCleanse Record Type Account DeDup Record Type CUT Address DataCleanse Record Type CUT Address DeDup Record Type Personal Address DataCleanse Record Type Personal Address DeDup Record Type
76
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
Configuring Data Quality with Oracle Enterprise Data Quality Oracle Enterprise Data Quality Field Mappings for Siebel Business Components
Parameters for enabling third-party vendors in Oracle Enterprise Data Quality Value contactclean. contactmatch. prospectclean.
Contact DataCleanse Record Type Contact DeDup Record Type List Mgmt Prospective Contact DataCleanse Record Type List Mgmt Prospective Contact DeDup Record Type Batch Max Num of Records Parameter 1
prospectmatch.
200 "session", "clusterlevel", "3". NOTE: There must be a space after each comma.
Parameter 2
"session", "matchthreshold", "50". NOTE: There must be a space after each comma.
Oracle Enterprise Data Quality Field Mappings for Siebel Business Components
The following field mappings work with the provided templates for data quality services, and the provided example SQL for creating the staging database. Note the following: All fields in these business components are drop-down fields, with the exception of Id. This field must be completed manually. If Oracle Enterprise Data Quality Address Verification is used for address cleaning:
The State field mapping only functions correctly if all addresses in the Siebel database are US addresses. If they are not, do not map this field. The Country field mapping may not function correctly if the Siebel CRM field is restricted to fixed list of values that do not exactly match those returned by the cleaning process.
This topic contains information on the following field mappings: Account - Data Cleansing on page 78 Account - DeDuplication on page 78 CUT Address - Data Cleansing on page 79
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
77
Configuring Data Quality with Oracle Enterprise Data Quality Oracle Enterprise Data Quality Field Mappings for Siebel Business Components
CUT Address - DeDuplication on page 80 Personal Address - Data Cleansing on page 80 Personal Address - DeDuplication on page 81 Contact - Data Cleansing on page 81 Contact - DeDuplication on page 82 List Mgmt Prospective Contact - Data Cleansing on page 83 List Mgmt Prospective Contact - DeDuplication on page 84
Table 20.
Mapped fields for Account data cleansing Mapped Field entityid languages name subname phone taxnumber website vatnumber
Business Component Field Id Language Code Name Location Main Phone Number Tax ID Number Home Page VAT registration number
Account - DeDuplication
Table 21 lists the mapped fields for deduplication of the Account business component.
Table 21.
Mapped fields for Account deduplication Mapped Field entityid languages name subname phone
Business Component Field Id Language Code Name Location Main Phone Number
78
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
Configuring Data Quality with Oracle Enterprise Data Quality Oracle Enterprise Data Quality Field Mappings for Siebel Business Components
Table 21.
Mapped fields for Account deduplication Mapped Field taxnumber vatnumber website
Business Component Field Tax ID Number VAT registration number Home Page
Table 22 lists the mapped fields for deduplication of the Account business component when the Multiple Child Entities parameter is disabled.
Table 22.
Mapped fields for Account deduplication when Multiple Child Entities is disabled Mapped Field address1 address2 address3 city adminarea postalcode country
Business Component Field Primary Account Street Address Primary Account Address Street Address2 Primary Account Address Street Address3 Primary Account City Primary Account State Primary Account Postal Code Primary Account Country
Table 23.
Mapped fields for CUT Address data cleansing Mapped Field address1 address2 city adminarea postalcode country
Business Component Field Street Address Street Address 2 City State Postal Code Country
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
79
Configuring Data Quality with Oracle Enterprise Data Quality Oracle Enterprise Data Quality Field Mappings for Siebel Business Components
Table 24.
Mapped fields for CUT Address deduplication Mapped Field address1 address2 city adminarea postalcode country
Business Component Field Street Address Street Address 2 City State Postal Code Country
Table 25.
Mapped fields for Personal Address data cleansing Mapped Field address1 address2 city adminarea postalcode country
Business Component Field Street Address Street Address 2 City State Postal Code Country
80
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
Configuring Data Quality with Oracle Enterprise Data Quality Oracle Enterprise Data Quality Field Mappings for Siebel Business Components
Table 26.
Mapped fields for Personal Address deduplication Mapped Field address1 address2 city adminarea postalcode country
Business Component Field Street Address Street Address 2 City State Postal Code Country
Table 27.
Mapped fields for Contact data cleansing Mapped Field individualid languages title gender firstname middlename lastname homephone workphone faxphone mobilephone email jobtitle
Business Component Field Id Language Code M/M M/F First Name Middle Name Last Name Home Phone # Work Phone # Fax Phone # Cellular Phone # Email Address Job Title
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
81
Configuring Data Quality with Oracle Enterprise Data Quality Oracle Enterprise Data Quality Field Mappings for Siebel Business Components
Table 27.
Mapped fields for Contact data cleansing Mapped Field taxnumber dob
Contact - DeDuplication
Table 28 lists the mapped fields for deduplication of the Contact business component.
Table 28.
Mapped fields for Contact deduplication Mapped Field individualid languages title gender firstname middlename lastname homephone workphone faxphone mobilephone email jobtitle taxnumber dob accountname
Business Component Field Id Language Code M/M M/F First Name Middle Name Last Name Home Phone # Work Phone # Fax Phone # Cellular Phone # Email Address Job Title Social Security Number Birth Date Primary Account Name
82
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
Configuring Data Quality with Oracle Enterprise Data Quality Oracle Enterprise Data Quality Field Mappings for Siebel Business Components
Table 29 lists the mapped fields for deduplication of the Contact business component when the Multiple Child Entities parameter is disabled.
Table 29.
Mapped fields for Contact deduplication when Multiple Child Entities is disabled Mapped Field address1 address2 address3 city adminarea postalcode country
Business Component Field Primary Street Address Primary Address Street Address2 Primary Address Street Address3 Primary City Primary State Primary Postal Code Primary Country
Table 30.
Mapped fields for List Mgmt Prospective Contact data cleansing Mapped Field individualid title gender firstname middlename lastname homephone workphone faxphone mobilephone email jobtitle taxnumber dob
Business Component Field Id M/M M/F First Name Middle Name Last Name Home Phone # Work Phone # Fax Phone # Cellular Phone # Email Address Job Title Social Security Number Birth Date
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
83
Configuring Data Quality with Oracle Enterprise Data Quality Oracle Enterprise Data Quality Field Mappings for Siebel Business Components
Table 31.
Mapped fields for List Mgmt Prospective Contact deduplication Mapped Field individualid title gender firstname middlename lastname homephone workphone faxphone mobilephone email jobtitle accountname taxnumber dob
Business Component Field Id M/M M/F First Name Middle Name Last Name Home Phone # Work Phone # Fax Phone # Cellular Phone # Email Address Job Title Account Social Security Number Birth Date
Table 32 lists the mapped fields for deduplication of the List Mgmt Prospective Contact business component when the Multiple Child Entities parameter is disabled.
Table 32.
Mapped fields for List Mgmt Prospective Contact deduplication when Multiple Child Entities is disabled Mapped Field address1 address2 city adminarea
84
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
Configuring Data Quality with Oracle Enterprise Data Quality Configuring ThirdParty Key Generation for Oracle Enterprise Data Quality
Table 32.
Mapped fields for List Mgmt Prospective Contact deduplication when Multiple Child Entities is disabled Mapped Field postalcode country
Enabling Support of Multiple Child Entity Deduplication for Oracle Enterprise Data Quality
When using third-party data quality vendors, you can enable Oracle Enterprise Data Quality to perform deduplication on multiple child entities of parent objects, such as Personal Address for the Contact object. To enable multiple child entity deduplication for Oracle Enterprise Data Quality, set the Support Multiple Child Entities Deduplication parameter to Yes as shown in Table 19. For an example of the data format, see Data Format for Deduplication of Multiple Child Entities on page 85
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
85
Configuring Data Quality with Oracle Enterprise Data Quality Data Format for Deduplication of Multiple Child Entities
The Name field is derived from the parent business component. The ListOf container is used for the Address child object. The Street field is a join field derived from the child business component.
<Data> <DriverRecord> <Name></Name> ... <ListOfAddress> <Address> <Street></Street> ... </Address1> <Address2> <Street></Street> ... </Address2> ... <AddressN> <Street></Street> ... </AddressN> </ListOfAddress> </DriverRecord> <CandidateRecord> </CandidateRecord> </Data> The following example shows the data format for the Contact object. <Data> <DriverRecord> <Contact.Id>12-12ABC1</Contact.Id> <FirstName>abc</FirstName> <LastName>abc</LastName> <ListOfAccount> <Account> <Name>A. K. Parker Inc</Name> <Location>San Mateo</Location> </Account> <Account> <Name>Hibbing Manufacturing</Name> <Location>San Jose</Location> </Account> </ListOfAccount> <ListOfPersonalAddress> <PersonalAddress> <City>York1</City> <Country>USA</Country> <Zip>78201</Zip> <State>AL</State> <Street> 1200 Davisville Road</Street> </PersonalAddress> </ListOfPersonalAddress>
86
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
Configuring Data Quality with Oracle Enterprise Data Quality Data Format for Deduplication of Multiple Child Entities
</DriverRecord> <CandidateRecord> <Contact.Id>12-BCDD</Contact.Id> <FirstName>abc1</FirstName> <LastName>abc1</LastName> <ListOfAccount> <Account> <Name>A. K. Parker Inc</Name> <Location>San Mateo</Location> </Account> </ListOfAccount> <ListOfPersonalAddress> <PersonalAddress> <City>York1</City> <Country>USA</Country> <Zip>78201</Zip> <State>AL</State> <Street> 1200 Davisville Road</Street> </PersonalAddress> </ListOfPersonalAddress> </CandidateRecord> <CandidateRecord> <Contact.Id>12-BCEE</Contact.Id> <FirstName>abc2</FirstName> <LastName>abc2</LastName> <ListOfAccount> </ListOfAccount> <ListOfPersonalAddress> <PersonalAddress> <City>York3</City> <Country>USA</Country> <Zip>78203</Zip> <State>AL</State> <Street> 1203 Davisville Road</Street> </PersonalAddress> </ListOfPersonalAddress> </CandidateRecord> </Data>
The following example shows the data format for the Account object. <Data> <DriverRecord> <Account.Id>12-ABCC</Account.Id> <Name>Hibbing Manufacturing</Name> <Location>San Jose</Location> <ListOfCUTAddress> <CUTAddress> <City>York1</City> <Country>USA</Country> <Zip>78201</Zip> <State>AL</State>
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
87
Configuring Data Quality with Oracle Enterprise Data Quality Data Format for Deduplication of Multiple Child Entities
<Street> 1200 Davisville </CUTAddress> </ListOfCUTAddress> </DriverRecord> <CandidateRecord> <Account.Id>12-AAAA</Account.Id> <Name>Hibbing Manufacturing1</Name> <Location>San Jose</Location> <ListOfCUTAddress> <CUTAddress> <City>York1</City> <Country>USA</Country> <Zip>78201</Zip> <State>AL</State> <Street> 1200 Davisville </CUTAddress> <CUTAddress> <City>York2</City> <Country>USA</Country> <Zip>78202</Zip> <State>AL</State> <Street> 1201 Davisville </CUTAddress> </ListOfCUTAddress> </CandidateRecord> <CandidateRecord> <Account.Id>12-BBBB</Account.Id> <Name>Hibbing Manufacturing2</Name> <Location>San Francisco</Location> </CandidateRecord> <CandidateRecord> <Account.Id>12-CCCC</Account.Id> <Name>Hibbing Manufacturing3</Name> <Location>San Mateo</Location> <ListOfCUTAddress> <CUTAddress> <City>York1</City> <Country>USA</Country> <Zip>78201</Zip> <State>AL</State> <Street> 1200 Davisville </CUTAddress> </ListOfCUTAddress> </CandidateRecord> </Data>
Road</Street>
Road</Street>
Road</Street>
Road</Street>
88
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
This chapter describes the data quality configuration that you can perform for Oracle Data Quality Matching Server and Oracle Data Quality Validation Server. It covers the following topics: Process of Installing the Oracle Data Quality Matching Server on page 90 Upgrading to Informatica Identity Resolution 9.01 on page 108 Process of Configuring Oracle Data Quality Matching Server (IIR) on page 110 Universal Connector Parameter and Field Mapping Values for Oracle Data Quality Matching Server on page 111 Configuring a New Field for Real-Time Data Matching on page 115 Incremental Data Load on page 118 Process of Configuring Data Synchronization Between Siebel and Oracle Data Quality Matching Server on page 120 Siebel Business Applications DQ Sync Action Sets on page 122 Configuring Deduplication Against Multiple Addresses on page 139 Configuring Multiple Language Support for Data Matching on page 141 Configuring Multiple Mode Support for Data Matching on page 145 Sample Configuration Files for Oracle Data Quality Matching Server on page 146 Process of Installing the Oracle Data Quality Address Validation Server on page 149 Configuring Siebel Business Applications for the Oracle Data Quality Address Validation Server on page 155 Universal Connector Parameter and Field Mapping Values for Oracle Data Quality Address Validation Server on page 156 Sample Configuration File for Oracle Data Quality Address Validation Server on page 159 Siebel Business Applications ISSSYNC Action Sets on page 161 Activating Action Sets in Your Siebel Business Application on page 184 Sample SQL Scripts on page 184 Sample SiebelDQ.sdf File on page 197
NOTE: You must be familiar with Siebel Tools before performing some of the data quality configuration tasks. For more information about Siebel Tools, see Using Siebel Tools and Configuring Siebel Business Applications.
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
89
Configuring Data Quality with Oracle Data Quality Servers Process of Installing the Oracle Data Quality Matching Server
1 2 3 4 5 6 7
Setting Up the Environment and the Database on page 90 Installing Oracle Data Quality Matching Server on page 92 Creating Database Users and Tables for Oracle Data Quality Matching Server on page 97 Configuring Oracle Data Quality Matching Server on page 100 Modifying Configuration Parameters for Oracle Data Quality Matching Server on page 104 Deploying Workflows for Oracle Data Quality Matching Server Integration on page 105 Initial Loading of Siebel Data into Oracle Data Quality Matching Server Tables on page 106
90
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
Configuring Data Quality with Oracle Data Quality Servers Process of Installing the Oracle Data Quality Matching Server
On UNIX, you set the PATH and CLASSPATH environment variables in the ssaset script file.
Network Protocol
Clients and Servers require a TCP/IP network connection. This includes DNS, which must be installed, configured and available (and easily contactable). The following paths (or their equivalents) must be correctly set up: /etc/hosts, /etc/resolv.conf and /etc/nsswitch.conf. Reverse name lookups must yield correct and consistent results.
ODBC Driver
The Oracle Data Quality Matching Server uses Open Database Connectivity (ODBC) to access source and target databases. ODBC Drivers for specific databases must be installed and working. Installing and configuring ODBC drivers is operating system and database dependent. Unless a driver is provided by Oracle Data Quality Matching Server (as is the case for an Oracle database), you must follow the instructions provided by your database manufacturer in order to install them. On Windows operating system, navigate to Control Panel, Administrative Tools, and then Data Sources (ODBC) to create a DSN and associate it with a driver and database server. At run time, the database layer attempts to load an appropriate ODBC driver for the type of database to be accessed. The name of the driver is determined by reading the odbc.ini file and locating a configuration block matching the database service specified in the connection string. For example, the database connection string odb:99:scott/tiger@ora920 refers to a service named ora920. A configuration block for ora920 looks similar to the following; the service name appears in square brackets: [ora920] ssadriver = ssaoci9 ssaunixdriver = ssaoci9 server = ora920.mydomain.com
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
91
Configuring Data Quality with Oracle Data Quality Servers Process of Installing the Oracle Data Quality Matching Server
A configuration block has the following syntax: [Service_Name] DataSourceName = ODBC_DSN ssadriver = ODBC_Driver ssaunixdriver = ODBC_UNIX_Driver server = Native_DB_Service_Name Table 33 on page 99 shows the databases supported by Oracle Data Quality Matching Server, describes the ODBC drivers required for different operating systems, and shows example odbc.ini configurations. NOTE: Oracle Data Quality Matching Server provides a custom driver for the Oracle database that is installed during the installation of the product. Oracle Data Quality Matching Server does not use the standard driver shipped with the Oracle DBMS.
NOTE: License key information for the Oracle Data Quality Matching Server is included in the product media pack on Oracle Software Delivery Cloud.
NOTE: You must install these options in the order that they are displayed.
Select Install License Server, click Next to continue, then do the following:
a b c
92
Browse to the installation directory where you want to install the License Server, then click Next. Enter the host name and port number for the License Server. Verify the installation summary details on the next screen that displays, then click Install.
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
Configuring Data Quality with Oracle Data Quality Servers Process of Installing the Oracle Data Quality Matching Server
d e
When installation is complete, you are prompted to start the License Server. Click No to close the prompt, then Finish to return to the main installer window. Copy the OEM license key file downloaded from Oracle Software Delivery Cloud to the following location: <Drive:>\InformaticaIR\licenses
f g
Set the environment variable SSALI_MZXPQRS to STANISLAUS (system variable). Start the License Server by selecting, for example, the following: Programs, Informatica, Identity Resolution V2.8.07 (InformaticaIR), Informatica License Server, and then Start.
Select the Install Informatica Product from the main installer window, click Next to continue, then do the following:
a b c d e f
When prompted to specify the path to the OEM license, browse to the [installation_media_directory]\data\file1003.dat file, and then click Next to continue. Enter the host name and port number for the License Server (or accept the default), then click Next. Browse to the installation directory where you want to install Informatica Identity Resolution, then click Next. The next screen displays a list of components, click Select All, and then Next. The next screen displays an installation summary of products and modules that you want to install. Review the details and click Next to confirm that they match your requirements. Select default port values for all servers. Make sure to add XML Synchronization server at port 1671. This server is not set by default. Click Next when done.
Service Name: Enter the database service name on Informatica Identity Resolution. This is used when configuring SIEBEL instances. ODBC Data Source Name: Enter the ODBC Connect String name if using ODBC (the ODBC Data Source name is required only when connecting through ODBC). ODBC Driver: Select the applicable database driver from the drop-down list (the ODBC driver name must be provided even when ODBC is not being used). Native Service: Enter the name for the database connection as defined in dB Client\ Server utilities (for example: for Oracle an databases, this is the TNS entry name).
Example settings when using an Oracle database are: Service Name: target ODBC Data Source Name: ODBC Driver: Oracle 9 (or above) client software Native Service Name: <tns_entry> NOTE: All configuration information entered in this step is written to the odbc.ini file. Table 33 on page 99 shows some example odbc.ini configurations.
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
93
Configuring Data Quality with Oracle Data Quality Servers Process of Installing the Oracle Data Quality Matching Server
h i 4 a
Post installation, do the following: Install the hot fix on top of the Base Installer for Informatica Identity Resolution 2.8.07. Make sure that you apply the latest Informatica Identity Resolution fix, which is available on Oracle Software Delivery Cloud. C:\InformaticaIR\bin>version SSA-NAME3 v2.8.07 (FixL106) SSA-NAME3 Extensions v2.8.07 (FixL106) Data Clustering Engine v2.8.07 (FixL106) Informatica Identity Resolution v2.8.07 (FixL106 + FixL113 + FixL114 + FixL120 + FixL123 + FixL124 + FixL125 + FixL126 + FixL127 + FixL134 + FixL136 + FixL140 + FixL141 + FixL145 + FixL147 + FixL148)
Rename xsserv.xml.org located in <drive>\InformaticaIR\bin to xsserv.xml. This file has a sample format. Change it to match the following: <server xmlns="_http://www.identitysystems.com/xmlschema/iss-version-1/ xmlserv"> <mode>generic</mode> <rulebase>odb:0:db_username/db_password@ISS_connectstring</rulebase> </server>
NOTE: If you do not make these changes to xsserv.xml, then errors might occur using legacy SIEBEL-ISS Sync workflows.
a b c d e
Obtain an installable ISO from Informatica or Oracle Software Delivery Cloud. Make sure to mount the ISO to a suitable location which is accessible to the user who is installing Informatica products. Ensure that the required ODBC entries (if applicable) are created. Ensure to add TNS entries pointing to the Informatica Identity Resolution Database (target database) for system configurations. The Informatica Installer requires a UNIX GUI for installation and product administration. If necessary, contact your system administrator to make sure that the corresponding applications and X-Server have been enabled for easy access.
94
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
Configuring Data Quality with Oracle Data Quality Servers Process of Installing the Oracle Data Quality Matching Server
After ensuring the installer location and GUI access, start the installer from the mounted location using the following command: ./install The Informatica Installer window opens with three options. You must install the three options in the order that they are displayed.
Select the Install License Server from the installer, click Next to continue, then do the following:
a b c d
Select the path where you want to install the license server, then click Next. Enter the port number for the License Server on the next screen that displays. You can accept the default (if available), or choose to change the port. Click Next when done. Verify the installation summary details on the next screen that displays, then click Install. When installation is complete, you are prompted to start the License Server. Click No, and then Finish to return to the main installer window. You must start the License Server only when the license file is available. Copy the OEM license key file downloaded from Oracle Software Delivery Cloud to the following location: <Drive:>/InformaticaIR/licenses
f g
Export the environment variable SSALI_MZXPQRS to STANISLAUS (system variable) before proceeding to the next step. Start the License Server:
Start an xterm / ssh session. Change to bash (Bourne Shell) Copy the license file to <installation_folder>/licenses
Set common environment variables by sourcing idsset script located at <IIR_Installation_Folder>/env. For example: . ./idsset
Set the environment variables required to start the License Server by sourcing script lienvs located at <IIR_Installation_Folder>/env. For example: . ./lienvs
j 4
Select Install Informatica Product from the main installer window, click Next to continue, then do the following:
a b c d
When prompted to specify the path to the OEM license, browse to the [installation_media_directory]/data/file1003.dat file, and then click Next to continue. Enter the License Server port number or accept the default, then click Next. The next screen displays a list of components. Licensed components have an editable check-box. Select the check box beside the required components and populations, and then click Next. The next screen displays a summary of selected options. Verify the details, then click Next.
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
95
Configuring Data Quality with Oracle Data Quality Servers Process of Installing the Oracle Data Quality Matching Server
e f
On the next screen, select or set servers and their ports, then click Next. If a port is already in use, you must change it. On the next screen, enter database information:
Service Name: Enter the database service name on Informatica Identity Resolution (this is used when configuring SIEBEL instances). ODBC Data Source Name: Enter the ODBC Connect String name if using ODBC (the ODBC Data Source name is required only when connecting through ODBC). ODBC Driver: Select the applicable database driver from the drop-down list (the ODBC driver name must be provided even when ODBC is not being used). Native Service: Enter the name for the database connection as defined in dB Client/ Server utilities (for example: for Oracle an databases, this is the TNS entry name).
Example settings when using an Oracle database are: Service Name: target ODBC Data Source Name: ODBC Driver: Oracle 9 (or above) client software Native Service Name: <tns_entry> Click Next to continue. NOTE: All configuration information entered in this step is written to the odbc.ini file. Table 33 on page 99 shows some example odbc.ini configurations.
g h i 5
The next screen displays an installation summary of products and modules that you want to install. Verify the details and confirm that they match your requirements. Click Install to start the installation. Click Finish to complete.
Post installation, rename xsserv.xml.ori located in <IIR_Installation_Folder>/bin to xsserv.xml. Change the contents of this sample file as follows:
a b
For example: <server xmlns="_http://www.identitysystems.com/xmlschema/iss-version-1/xmlserv"> <mode>generic</mode> <rulebase>odb:0:db_username/db_password@ISS_connectstring</rulebase> </server> NOTE: If you do not make these changes to xsserv.xml, then errors might occur using legacy SIEBEL-ISS Sync workflows.
96
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
Configuring Data Quality with Oracle Data Quality Servers Process of Installing the Oracle Data Quality Matching Server
Creating Database Users and Tables for Oracle Data Quality Matching Server
Creating database users and tables involves executing a number of scripts on the Oracle Data Quality Matching Server database. The scripts that you must execute are located in the Oracle Data Quality Applications media pack on Oracle Software Delivery Cloud. Once the media pack is downloaded and installed, the scripts are located in the InformaticaIR/dbscript/ora folder. For example (idsuseru.sql): C:/InformaticaIR/dbscript/ora/idsuseru.sql You must open these scripts and modify them as required, depending on the database that you are using. For example, complete the steps in the following procedure to create database users and database tables for Oracle Data Quality Matching Server if using an Oracle database. Note the following: The procedure is similar if using Microsoft SQL Server, UDB, or DB2 on OS/390. However, you must modify the SQL scripts according to the database that you are using. The procedure is also similar whether creating database users and database tables for Oracle Data Quality Matching Server on Microsoft Windows or on UNIX. When setting up the database for Oracle Data Quality Matching Server on UNIX, you must set TNSNAmes.ora with an entry to the target database (Oracle Data Quality Matching Server database), and perform connectivity testing using SQLPLUS if required.
For more information about testing the connectivity on UNIX, see the relevant documentation included in Siebel Business Applications Third-Party Bookshelf in the product media pack on Oracle Software Delivery Cloud. This task is a step in Process of Installing the Oracle Data Quality Matching Server on page 90.
To create database users and tables for Oracle Data Quality Matching Server if using an Oracle database 1
Log in to the database as database administrator, then execute the idsuseru.sql script to create a new database user with appropriate privileges to create and update Oracle Data Quality Matching Server tables. NOTE: You must be logged in as database administrator to execute idsuseru.sql.
Log in to the database as the new database user (created in Step 1 with appropriate privileges to create and update Oracle Data Quality Matching Server tables), then execute the following SQL scripts to create other Oracle Data Quality Matching Server database tables, such as IDT and IDX tables. You can execute the following SQL scripts in any order: NOTE: IDT tables store the copy of source records in the Oracle Data Quality Matching Server database. IDX tables store the index keys for IDT tables. Each IDT table can have one or more IDX tables associated with it.
a b
Execute idstbora.sql to create control tables for the Oracle Data Quality Matching Server. Execute updsyncu.sql to create database objects required by the Oracle Data Quality Matching Server to synchronize data in ID tables with updates to user source tables.
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
97
Configuring Data Quality with Oracle Data Quality Servers Process of Installing the Oracle Data Quality Matching Server
Run this script on all databases containing user source tables that require synchronization, and also before loading any ID tables that require synchronization.
Execute updsynci.sql to create database objects required by the Oracle Data Quality Matching Server to synchronize data in ID tables with updates to user source tables. Run this script on the database which will contain IDTs, and also before loading any ID tables that require synchronization.
Execute updsyncg.sql to create database objects required by the Oracle Data Quality Matching Server to synchronize data in SSA-ID tables with updates to user source tables. This script will create public synonyms for the Oracle Data Quality Matching Server objects created on user source table databases. This script must be run by someone (for example, the database administrator) who has the privilege to CREATE PUBLIC SYNONYM. Run this script after running updsyncu.sql. Use the same userid to run updsynci.sql as you did to run updsyncu.sql. NOTE: You must be logged in as database administrator to execute updsyncg.sql.
98
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
Configuring Data Quality with Oracle Data Quality Servers Process of Installing the Oracle Data Quality Matching Server
Table 33.
Example odbc.ini Configuration Blocks Description The Oracle database driver works out-of-the box and is named %SSABIN%\ssaoci{8|9}.dll on Windows, and $SSABIN/ libssaoci{8|9}.s{o|l} on UNIX. There are no special setup requirements, other than adding configuration blocks to your odbc.ini file. The ODBC_Driver name can be either ssaoci8 or ssaoci9. The former must be used with Oracle 8 client libraries and does not support Unicode data. The latter can be used with Oracle 9 (or later) client libraries and supports Unicode access. When using the ssaoci9 driver with Oracle Database 10g client software, the connectivity test might fail on some UNIX operating systems. This occurs because the driver has been linked with libclntsh.so.9.0, which is not distributed with Oracle Database 10g. Oracle normally provides backward compatibility by adding symbolic links to redirect requests for older versions of the library to the current version. Unfortunately, by default, this practice is restricted to minor versions only (for example, 9.0-9.2). To overcome the problem, locate the appropriate Oracle lib directory (lib, lib32, or lib64) and add a symbolic link. For example: cd $ORACLE_HOME/lib32 ln -s ./libclntsh.so libclntsh.so.9.0 Example odbc.ini Configurations [ora10g] ssadriver = ssaoci9 ssaunixdriver = ssaoci9 server = ora10g.mynet8tns.name
UDB must be installed prior to the installation of Oracle Data Quality Matching Server. IBM provides ODBC drivers for both Windows and UNIX operating systems, named db2cli and db2 respectively. For more information about the db2cli and db2 drivers, see the appropriate UDB manuals for full details.
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
99
Configuring Data Quality with Oracle Data Quality Servers Process of Installing the Oracle Data Quality Matching Server
Table 33.
Example odbc.ini Configuration Blocks Description Microsoft provides a Windows ODBC driver named sqlsrv32. It is configured by adding a new Data Source Name (DSN) using Control Panel, Administrative Tools, Data Sources (ODBC). For more information about the sqlsrv32 driver, see the appropriate Microsoft manuals for specific details. The ODBC_Driver name is sqlsrv32 and the Native_DB_Service is the server name (-S parameter of the osql and bcp utilities). The SQL Server Native Client (sqlncli.dll) can be used as an alternative driver. Example odbc.ini Configurations [production] DataSourceName = msq2003 ssadriver = sqlsrv32 server = mySQLServer
Sybase
For more information about the sybdrvodb drivers, see the appropriate Sybase manuals for installation specifics.
Testing Connectivity
Use the dblist utility to test your ODBC configuration by connecting to a database whose connection string is provided with the -d parameter. An example of the output associated with a successful connection follows: $SSABIN/dblist -c -dodb:99:ssa09/SSA09@ora920 Maximum connections per module: 1024 Linked databases: odb: sdb: Driver Manager: 'Identity Systems ODBC Driver Manager 1.2.2.3' ODBC Driver: 'ssaoci9 SSADB8 2.7.0.00MSVC60 Jun 8 2006 17:26:56' DBMS Name: 'Oracle DBMS (9.2.0.6.0)' Native DB type: 'ora'
This task is a step in Process of Installing the Oracle Data Quality Matching Server on page 90.
100
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
Configuring Data Quality with Oracle Data Quality Servers Process of Installing the Oracle Data Quality Matching Server
To configure Oracle Data Quality Matching Server for data matching on Microsoft Windows 1
If required, modify the odbc.ini file located at <drive>:\<IIR_Installation_Folder>\InformaticaIR\bin\ to contain the ODBC connection string of your target database, for example, as follows: [Target] ssadriver=ssaoci9 server=qa19b_sdchs20n519 Table 33 on page 99 describes the ODBC drivers required for different operating systems. NOTE: For an Oracle database, the server parameter specifies a connect string from the tnsnames.ora file (which is the network configuration file of the Oracle database client). For other databases, the server contains the ODBC datasource name (DSN). The database information that you enter when installing Oracle Data Quality Matching Server is reflected in the odbc.ini file. If all values are correct and you do not want to make any changes to the database information, then you can skip this step.
Copy the SiebelDQ.sdf file to the following (IIR server) folder location: <Drive>:\<IIR_Installation_Folder>\InformaticaIR\ids NOTE: For an example SDF file, see Sample SiebelDQ.sdf File on page 197.
To use the XML Sync Server instead of the External Business Components for Informatica Identity Resolution, then activate or deactivate the following ports located in <Drive>:\<IIR_Installation_Folder>\env\isss.bat. ::set SSA_XSPORT=1671 ::set SSA_XSHOST=localhost:1671 Removing the double colon from the beginning of the line activates the process listening on the ports: set SSA_XSHOST=localhost:1671 set SSA_XSPORT=1671 NOTE: For Informatica Identity Resolution Version 2.7, you turn on the XML Sync Server by modifying the idsenvs.bat file located in <Drive>:\<ISS Installation Folder>\iss2704s\bin.
Create a tmp folder for the IIR Synchronizer Workflow Log in <Drive>:\<IIR_Installation_Folder>\InformaticaIR\. For example: C:\InformaticIR\tmp
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
10 1
Configuring Data Quality with Oracle Data Quality Servers Process of Installing the Oracle Data Quality Matching Server
NOTE: If you install Oracle Data Quality Matching Server on a different drive (other than C:\), you must modify the ISSErrorHandler workflow in your Siebel application to specify the correct log folder. Other modifications that must be made if you install Oracle Data Quality Matching Server on a drive other than C:\ include modifying action sets and the location where you deploy the XML files.
Start the IIR Server by navigating to, for example, the following: Programs, Informatica, Identity Resolution V2.8.07 (InformaticaIR), Informatica Identity Resolution, Informatica IR Server - Start(Configure Mode) NOTE: You can also start the Informatica Identity Resolution server from the command prompt using the idsup command.
Start the IIR Console Client (in Admin Mode) by navigating to, for example, the following: Programs, Informatica, Identity Resolution V2.8.07 (InformaticaIR), Informatica Identity Resolution, Informatica IR Console Client - Start(Admin Mode)
Create a new system in IIR using SiebelDQ.sdf. The system that you create in IIR (Console Client, Admin Mode) will hold all the IDT and IDX database tables. For more information about creating a new system in IIR, see the relevant documentation included in Siebel Business Applications Third-Party Bookshelf in the product media pack on Oracle Software Delivery Cloud.
When the system is created (initially, it will be empty), run LoadIDT from the IIR Console Client. For more information, see Initial Loading of Siebel Data into Oracle Data Quality Matching Server Tables on page 106.
To configure Oracle Data Quality Matching Server for data matching on UNIX 1
Copy the most recent version of the shared library libssaiok.so (libssaiok.sl on HP-UX) to the SSA-NAME3 bin directory. If the version packaged with IIR is more recent than the one packaged with SSA-NAME3, copy the ssaiok shared library from the IIR server distribution to the SSA-NAME3 bin directory as follows: cp $SSATOP/common/bin/libssaiok.* $SSAN3V2TOP/bin No action is required if the version packaged with IIR is older than the one packaged with SSANAME3.
Set the shared library path according to your operating system. The following table shows examples of shared library paths.
102
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
Configuring Data Quality with Oracle Data Quality Servers Process of Installing the Oracle Data Quality Matching Server
If required, modify the odbc.ini file to contain the ODBC connection string of your target database:
a b
Copy the odbc.ini.ori file located in the $SSATOP/bin folder, and rename it odbc.ini. Edit the odbc.ini to contain the ODBC connection string of your target database, for example, as follows: [Target] ssaunixdriver=ssaoci9 server=<TNS_entry_name_from_tnsnames.ora> For an Oracle database, the server parameter specifies a connect string from the tnsnames.ora file (which is the network configuration file of the Oracle database client). For other databases, the server contains the ODBC datasource name (DSN). Most UNIX installations do not need the ODBC DSN, but if required, parameters change accordingly: [Target] DataSourceName=ODBC_DNS_Name_Pointing_to_ISS_DB ssaunixdriver=<ssaoci9> Table 33 on page 99 describes the ODBC drivers required for different operating systems.
The database information that you enter when installing Oracle Data Quality Matching Server is reflected in the odbc.ini file. If all values are correct and you do not want to make any changes to the database information, then you can skip this step.
Copy the System Definition File (SDF) to the UNIX server. Make sure that the SDF file is compressed before using FTP to copy it to the UNIX server. You must use the -a switch to extract a file on a UNIX server, for example, as follows: unzip - sysdeffile.zip
For more information about configuring ODBC on UNIX, see the relevant documentation included in Siebel Business Applications Third-Party Bookshelf in the product media pack on Oracle Software Delivery Cloud.
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
10 3
Configuring Data Quality with Oracle Data Quality Servers Process of Installing the Oracle Data Quality Matching Server
To modify configuration parameters for the Oracle Data Quality Matching Server 1 2
Open up a text editor. Modify the following parameters in ssadq_cfg.xml, as required:
a b c
Set <iss_host> to point to the server where Oracle Data Quality Matching Server is running. Set <iss_port> to 1666 (which is the default), unless you are using a different port for installation. Set the <rulebase_name> parameter. For example, with Oracle Database 10g:
username is ssa password is SIEBEL ServiceName is Target (as specified in the odbc.ini file for the Oracle Data Quality Matching Server server) <rulebase_name> Example: odb:0:ssa/SIEBEL@Target
For more information about the format of the rulebase name, see the relevant documentation included in Siebel Business Applications Third-Party Bookshelf in the product media pack on Oracle Software Delivery Cloud.
Set <contact_system>, <account_system>, and <prospect_system> to the name of the system that you create in Oracle Data Quality Matching Server (IIR) using the SiebelDQ.sdf file. The system that you create in IIR (Console Client, Admin Mode) will hold all the IDT and IDX database tables. For more information about creating a new system in IIR, see the relevant documentation included in Siebel Business Applications Third-Party Bookshelf in the product media pack on Oracle Software Delivery Cloud. If you want to run Oracle Data Quality Matching Server against only a single entity (for example, Accounts) as opposed to multiple entities (Accounts, Contacts, and Prospects), then you must alter the definitions within the SiebelDQ.sdf file to include only the one entity that you want as otherwise the synchronizer fails to run. In this example, you must remove the definitions for Contacts and Prospects. Any changes that you make to the SDF file must be appended to the user property for the business service DQ Sync Services. If you do not want to use a particular field (for example, Birth Date) as part of deduplication, then that field must be removed from the SDF file. In addition, you must do the following:
Remove the corresponding mapping from data quality third-party administration settings in your Siebel application.
104
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
Configuring Data Quality with Oracle Data Quality Servers Process of Installing the Oracle Data Quality Matching Server
Change the user property in the DQ Sync Services business process. For example: For Account, change the Account_DeDupFlds user property. For Contact, change the Contact_DeDupFlds user property.
Remove the DeDup field from the user property. Remove the corresponding mapping in the user property for external length. This is Account_ExtLen for Account, and Contact_ExtLen for Contact.
Since CUT Address is shared across Account and Contact, any change in the CUT Address is reflected in both Account and Contact de-duplication.
Save the ssadq_cfg.xml file and copy to the SDQConnector folder on Siebel Server for changes to take effect: siebsrvr/SDQConnector
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
10 5
Configuring Data Quality with Oracle Data Quality Servers Process of Installing the Oracle Data Quality Matching Server
These workflows are used to synchronize data between the Siebel application and Oracle Data Quality Matching Server (IIR) tables. For more information about Siebel Workflow, see Siebel Business Process Framework: Workflow Guide. NOTE: The activation or deactivation of these workflows depends on your business needs. The business service, DQ Sync Services, can be used if you are using the multiple address feature.
Initial Loading of Siebel Data into Oracle Data Quality Matching Server Tables
To initially load your Siebel application data into Oracle Data Quality Matching Server (IIR) tables, complete the steps in the following procedure. This procedure uses SQL scripts and is for large implementations where, for example, the database is too large to use an XML file import or export to initially load Siebel application data into Oracle Data Quality Matching Server tables. CAUTION: Before proceeding any further, you must read, understand, and follow the following guidelines: It is highly recommended that data is directly loaded from source tables into Oracle Data Quality Matching Server tables. The sample system definition file (SiebelDQ.sdf) includes appropriate sections to load data directly from source tables into Oracle Data Quality Matching Server tables. NOTE: For an example SDF file, see Sample SiebelDQ.sdf File on page 197. The system definition file includes information about the matching criteria for various entities. As part of the initial analysis, it is essential that you review the sample system definition file (SiebelDQ.sdf) and make appropriate changes to it, before creating any new systems in IIR. The sample system definition file (SiebelDQ.sdf) is not a preconfigured configuration file; it serves as a sample for you to start with. Make sure that the entries in the system definition file are in sync with the data quality configuration settings that you set up in your Siebel application (in Administration - Data Quality screen, Third Party Administration view). Make sure that the user properties that you set up in Siebel Tools for the business service are in sync with the entries in your system definition file.
NOTE: If you encounter errors when trying to initially load a high volume of data (greater than 10,000 records), then set the system environment variable SSAOCI_IGNORE_UCS2_BYTES to one, and restart the Oracle Data Quality Matching Server server and client. Also, adding zeros when setting the SSA_XML_SIZE parameter can help when initially loading large files. For example: set SSA_XML_SIZE to 8000000. This task is a step in Process of Installing the Oracle Data Quality Matching Server on page 90.
106
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
Configuring Data Quality with Oracle Data Quality Servers Process of Installing the Oracle Data Quality Matching Server
To initially load Siebel application data into Oracle Data Quality Matching Server tables 1
Start the IIR Server by navigating to, for example, the following: Programs, Informatica, Identity Resolution, v2.8.07 (InformaticaIR), Informatica Identity Resolution, Informatica IR Server - Start(Configure Mode)
Start the IIR Console Client (in Admin Mode) by navigating to: Programs, Informatica, Identity Resolution, v2.8.07(InformaticaIR), Informatica Identity Resolution, Informatica IR Console Client - Start(Configure Mode)
If not already done so, create a new system in IIR using the appropriate System Definition file that you have reviewed and modified using the sample SiebelDQ.sdf file as a starting point. Or, if a system already exists, select it and refresh it by clicking the System/Refresh button. The system that you create in IIR (Console Client, Admin Mode) will hold all the IDT and IDX database tables. For more information about creating a new system in IIR, see the relevant documentation included in Siebel Business Applications Third-Party Bookshelf in the product media pack on Oracle Software Delivery Cloud. NOTE: If you want to run IIR against only a single entity (for example, Accounts) as opposed to multiple entities (Accounts, Contacts, and Prospects), then you must alter the definitions within the SiebelDQ.sdf file to include only the one entity that you want as otherwise the synchronizer fails to run. In this example, you must remove the definitions for Contacts and Prospects.
Run the IDS_IDT_<ENTITY TO BE LOADED>_STG.sql script to take a snapshot of records in the Siebel application. For example, for account initial load, execute the following script from the SQL prompt as user SSA_SRC: IDS_IDT_ACCOUNT_STG.sql Depending on project requirements, IIR configuration, and data quality configuration, you must modify sample scripts provided with the software accordingly. NOTE: It is not mandatory to always load the data incrementally. If the initial volume of data to load is not high, then you can load the data directly from source tables to IIR tables in one go.
Run the IDS_IDT_CURRENT_BATCH_<ENTITY TO BE LOADED>.sql script to create the dynamic view to load the snapshot created in Step 4. For example, for account initial load, execute the following script from the SQL prompt as user SSA_SRC: IDS_IDT_CURRENT_BATCH_ACCOUNTS.sql To be in sync with the snapshot created in Step 4 and the SDF file used for system creation in Step 3, you must modify the sample scripts provided with the software according to project requirements, IIR configuration, and data quality configuration. Also, use a batch size that is appropriate to your project needs, initial data load volume, and any other project specific needs.
Run the following SQL script to create the database table to store the current batch number being loaded: IDS_IDT_CURRENT_BATCH.sql
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
10 7
Configuring Data Quality with Oracle Data Quality Servers Upgrading to Informatica Identity Resolution 9.01
Load IIR with data from the Siebel application by clicking the System/Load IDT button. Make sure to select the All_load option from the Loader Definition menu in the dialog that displays. This process loads records with batch number 1 from the snapshot created earlier. Validate the data to make sure that all the records with batch number 1 are correctly loaded.
8 9
Open a command window and navigate to the directory where the initial load scripts were copied during product installation. Execute the initial load process by entering the following command at the command line: IDS_IDT_LOAD_ANY_ENTITY.CMD <Entity> <System> <Work Directory> For example, for account initial load, execute the following script: IDS_IDT_LOAD_ANY_ENTITY.CMD Account c:/initialLoad/logs This loads data in batches from the snapshot created in Step 4. The log files and error files recording the outcome of each batch load are stored in the C:/InitialLoad/logs directory.
10 Examine the log files and error files to identify any batch that failed to load. Use the information
in the log and error files to determine the root cause for any failure and fix the underlying issue.
11 Incrementally load the failed batches individually using the following script from the command
line: IDS_IDT_LOADBATCH_ANY_ENTITY.CMD For example, to load batch 33 of account, execute the following script from the command line: IDS_IDT_LOADBATCH_ANY_ENTITY.CMD Account SiebelDQ c:/initialLoad/logs 33
12 Examine the log files and error files to ensure that the (failed) batches successfully loaded. In
case of errors, use the information in the log and error files to determine the root cause for the failure and fix the underlying issue. Repeat Step 11 until all the batches have successfully loaded.
13 Repeat this process to load other entities such as contacts and prospects.
108
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
Configuring Data Quality with Oracle Data Quality Servers Upgrading to Informatica Identity Resolution 9.01
Add the following for Account: User Property Account Record Length Account_Business Address_DataType Account_Business Address_DeDupFlds Account_Business Address_ExtLen Value 1092 W|W|W|W|W|C|C City|Country|Postal Code|State|Street Address|Address Id|CL ID 200|120|120|40|200|60|2
Add the following for Contact: User Property Contact Record Length Contact_INS Personal Address_DataType Contact_INS Personal Address_DeDupFld Value 1852 W|W|W|W|W|C|C INS Personal City|INS Personal Country|INS Personal Postal Code|INS Personal State|INS Personal Street Address|INS Personal Address Id|CL ID 120|40|40|40|200|60|2
Add the following for Prospect: User Property Prospect Record Length Prospect_DataType Prospect_DeDupFlds Value 1772 W|W|W|W|W|W|W|W|W|W|W|W|W|C|C Account|Cellular Phone #|City|Country|Email Address|First Name Last Name|Home Phone #|Middle Name|Postal Code|Social Security Number|State|Street Address|Work Phone #|Id|CL ID 200|120|120|60|120|200|120|200|40|120|40|200|200|30|2
Prospect_ExtLen
Apply the Informatica Address Doctor Version 5 license in the ssaasmv5.xml file as follows:
a b
Open the ssaasmv5.xml file which is located in the InformaticaIR/ssaas/ad5 folder. Configure the Informatica Address Doctor Version 5 license as follows:
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
10 9
Configuring Data Quality with Oracle Data Quality Servers Process of Configuring Oracle Data Quality Matching Server (IIR)
Go to the <ASM_ADv5_Config> tag. Set the unlock_code value as follows: <ASM_ADv5_Config> <MAX_THREAD>1</MAX_THREAD> <MAX_ADOBJECTS>1</MAX_ADOBJECTS> <AD5_UNLOCK_CODE> <UNLOCK_CODE>unlock_code</UNLOCK_CODE> </AD5_UNLOCK_CODE> </ASM_ADv5_Config>
NOTE: If Informatica Identity Resolution 9.01 is being used on UNIX, the ssaasmv5.xml file has the following blank tag under <ASM_ADv5_CONFIG>, which must either be set with a proper value or completely removed from the ssaasmv5.xml file: <ENRICHMENT_OPTION> </ENRICHMENT_OPTION>
110
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
Configuring Data Quality with Oracle Data Quality Servers Universal Connector Parameter and Field Mapping Values for Oracle Data Quality Matching Server
Change the DeDuplication Data Type setting to ISS on all object managers as described in Enabling Data Quality at the Enterprise Level on page 35. This parameter can be set at the Enterprise, Siebel Server, or component level. For example, srvrmgr commands similar to the following can be used to set the parameters: Change Change Change Change param param param param DeDupTypeType =ISS DeDupTypeEnable =True, DeDupTypeType =ISS for comp DQMgr DeDupTypeEnable =True, DeDupTypeType =ISS for comp SCCObjMgr_enu DeDupTypeEnable =True, DeDupTypeType =ISS for comp UCMObjMgr_enu
NOTE: You must change the DeDuplication Data Type setting to ISS on all object managers for deduplication with Oracle Data Quality Matching Server to be active.
Set data quality settings as described in Specifying Data Quality Settings on page 37. Make sure that the following parameters are set to Yes: Enable DeDuplication Force User DeDupe - Account Force User DeDupe - Contact Force User DeDupe - List Mgmt
Verify that the preconfigured vendor parameter and field mapping values are set up as listed in Universal Connector Parameter and Field Mapping Values for Oracle Data Quality Matching Server on page 111. Modify the ssadq_cfg.xml file as described in Modifying Configuration Parameters for Oracle Data Quality Matching Server on page 104
For more information about Siebel Server configuration and management, see Siebel System Administration Guide.
Universal Connector Parameter and Field Mapping Values for Oracle Data Quality Matching Server
This topic includes information about the Universal Connector parameter and field mapping values for the Oracle Data Quality Matching Server: Preconfigured Vendor Parameters for Oracle Data Quality Matching Server on page 112 Preconfigured Field Mappings for Oracle Data Quality Matching Server on page 112
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
11 1
Configuring Data Quality with Oracle Data Quality Servers Universal Connector Parameter and Field Mapping Values for Oracle Data Quality Matching Server
112
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
Configuring Data Quality with Oracle Data Quality Servers Universal Connector Parameter and Field Mapping Values for Oracle Data Quality Matching Server
Table 35.
Preconfigured Oracle Data Quality Matching Server Field Mappings for Account Mapped Field PAccountCity PAccountCountry PAccountPostalCode PAccountState PAccountStrAddress RowId
Business Component Field Primary Account City Primary Account Country Primary Account Postal Code Primary Account State Primary Account Street Address Row Id
Business Component Field Birth Date Cellular Phone # Email Address First Name Last Name Home Phone # Middle Name Primary Account Name Primary Personal City Primary Personal Country Primary Personal Postal Code Primary Personal State Primary Personal Street Address Row Id Social Security Number Work Phone #
Preconfigured Field Mappings for Business Component - List Mgmt Prospective Contact
Table 37 shows the Oracle Data Quality Matching Server data matching field mappings for the List Mgmt Prospective Contact business component and DeDuplication operation.
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
11 3
Configuring Data Quality with Oracle Data Quality Servers Universal Connector Parameter and Field Mapping Values for Oracle Data Quality Matching Server
Table 37.
Preconfigured Oracle Data Quality Matching Server Field Mappings for List Mgmt Prospective Contact Mapped Field Account CellularPhone City Country EmailAddress NAME HomePhone RowId MiddleName PostalCode SocialSecurityNumber State StreetAddress WorkPhone
Business Component Field Account Cellular Phone # City Country Email Address First Name Last Name Home Phone # Id Middle Name Postal Code Social Security Number State Street Address Work Phone
Business Component Field City Country Postal Code Row Id State Street Address
114
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
Configuring Data Quality with Oracle Data Quality Servers Configuring a New Field for Real-Time Data Matching
Select the Vendor shown in the following table: Vendor Name ISS DLL Name ssadqsea
Select the business component operation shown in the following table: Business Component Name Contact Operation DeDuplication
Add the field mapping shown in the following table: Business Component Field Position Mapped Field MyPosition
If using the old synchronizer, modify the Identity Search Server synchronization Integration Object by adding the new fields to it. In this example, you must modify the ISS_Contact to add the new Integration Component Field as shown in the following table: Name Position Data Type DTYPE_TEXT Length 50 External Name Position XML Tag MyPosition
NOTE: For a Contact, you must modify the ISS_Contact integration object. For an Account, you must modify the ISS_Account integration object. For a Prospect, you must modify the ISS_List_Mgmt_Prospective_Contact integration object.
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
11 5
Configuring Data Quality with Oracle Data Quality Servers Configuring a New Field for Real-Time Data Matching
Add the new field to the Synchronize Integration Object. In this case, 'SyncContact' IO. For the contact address field, add it to the 'Contact_INS Personal Address' Integration Component. Add the new field to the 'DQ Sync Services' user property. In this case, 'Contact_DeDupFlds'. For the contact address field, add it to the 'Contact_INS Personal Address_DeDupFlds' user property. Add the new field length to one or both of the following 'DQ Sync Services' user properties: 'Contact_ExtLen' 'Contact_INS Personal Address_ExtLen' NOTE: It is mandatory that you maintain the same sequence that is detailed in the sdf file. Also make sure that the address fields are grouped together at the end of the sdf file.
Enter the new record length into the 'Contact Record Length' user property. This user property holds a total of all the field lengths in 'Contact_ExtLen' and 'Contact_INS Personal Address_ExtLen'.
NOTE: ISSDataSrc must be added to the OM - Named Data Source component parameter for the UCM object manager and EAI object manager components.
Add the new fields to the IDT table in IIR: In the following example for the old synchronizer, you must add the new Position field to the IDT_Contact. For example: create_idt IDT_CONTACT SOURCED_FROM FLAT_FILE BirthDate CellularPhone EmailAddress NAME HomePhone MiddleName Account City Country PrimaryPostalCode State StreetAddress RowId SocialSecurityNumber MyPosition WorkPhone SYNC_REPLACE_DUPLICATE_PK TXN-SOURCE NSA ;
w(60), w(60), w(60), w(100), w(60), w(100), w(100), w(60), w(20), w(20), w(20), w(100), w(30) w(60) w(60) w(60)
116
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
Configuring Data Quality with Oracle Data Quality Servers Configuring a New Field for Real-Time Data Matching
In the following example for the new synchronizer, you must add the new Position field to the IDT_Contact. create_idt IDT_CONTACT sourced_from odb:15:ssa_src/ssa_src@ISS_DSN INIT_LOAD_ALL_CONTACTS.BIRTHDATE INIT_LOAD_ALL_CONTACTS.CELLULARPHONE INIT_LOAD_ALL_CONTACTS.EMAILADDRESS INIT_LOAD_ALL_CONTACTS.NAME INIT_LOAD_ALL_CONTACTS.HOMEPHONE INIT_LOAD_ALL_CONTACTS.MIDDLE_NAME INIT_LOAD_ALL_CONTACTS.ACCOUNT INIT_LOAD_ALL_CONTACTS.CONTACT_ID (pk1) INIT_LOAD_ALL_CONTACTS.SOCIALSECURITYNUMBER INIT_LOAD_ALL_CONTACTS.MYPOSITION INIT_LOAD_ALL_CONTACTS.WORKPHONE INIT_LOAD_ALL_CONTACTS.CITY INIT_LOAD_ALL_CONTACTS.COUNTRY INIT_LOAD_ALL_CONTACTS.POSTAL_CODE INIT_LOAD_ALL_CONTACTS.STATE INIT_LOAD_ALL_CONTACTS.STREETADDRESS INIT_LOAD_ALL_CONTACTS.ADDRESS_ID (pk2) SYNC REPLACE_DUPLICATE_PK TXN-SOURCE NSA ;
BirthDate CellularPhone EmailAddress NAME HomePhone MiddleName Account RowId SocialSecurityNumber MyPosition WorkPhone City Country PrimaryPostalCode State StreetAddress ContactAddressID
V(60), V(60), V(60), V(100), V(60), V(100), V(100), C(30), V(60), V(60), V(60), V(60), V(20), V(20), V(20), V(100), C(60)
Modify SCORE-LOGIC in the IIR search definition: A set of field types are provided that are supported by Match Purpose. For Contact Match Purpose, the required and optional field types are shown in the following table: Field Person_Name Organization_Name Address_Part1 Address_Part2 Posal_Area Telephone_Number ID Date Attribute1 Attribute2 Required Yes Yes Yes No No No No No No No
If you want the new field to contribute to the match score, add it to the SCORE-LOGIC section in IIR search definition. For example:
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
11 7
Configuring Data Quality with Oracle Data Quality Servers Incremental Data Load
search-definition ================== NAME= "search-person-name" IDX= IDX_CONTACT_NAME COMMENT= "Use this to search and score on person" KEY-LOGIC= SSA, System(default), Population(usa), Controls("FIELD=Person_Name SEARCH_LEVEL=Typical UNICODE_ENCODING=6", Field(Name) SCORE-LOGIC= SSA, System(default), Population(usa), Controls("Purpose=Person_Name MATCH_LEVEL=Typical UNICODE_ENCODING=6", Matching-Fields ("Name:Person_Name,StreetAddress:Address_Part1,City:Address_part2,State:Attr ibute1,PrimaryPostalCode:Postal_area,MyPosition:Attribute2")
Delete the existing system in IIR, and then create a new system using the new SiebelDQ.sdf file. For more information about creating a new system in IIR (which will hold all the IDT and IDX database tables), see the relevant documentation included in Siebel Business Applications ThirdParty Bookshelf in the product media pack on Oracle Software Delivery Cloud.
Reload the IIR system as described in Initial Loading of Siebel Data into Oracle Data Quality Matching Server Tables on page 106
To incrementally load Siebel application data into Informatica Identity Resolution tables 1 2 3 4
Start the Informatica Identity Resolution server in Configure Mode. Start the Informatica Identity Resolution Console client in Admin Mode. Select System (if already created), otherwise create a new system using the required sdf file. Run the IDS_IDT_<ENTITY TO BE LOADED>_STG.sql script to take a snapshot of records in the Siebel application. For example, execute the following script from the SQL command line prompt for account initial load using your appropriate login credentials: IDS_IDT_ACCOUNT_STG.sql
118
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
Configuring Data Quality with Oracle Data Quality Servers Incremental Data Load
The following sample SQL scripts can be used to capture snapshots of the data:
For more information about these example scripts, see Sample SQL Scripts on page 184.
While creating a snapshot using the example scripts listed in the previous step, users are prompted to enter a batch size. Depending on the value entered, the entire snapshot is grouped into batches of the specified batch size. For example, run the following SQL script to create the database table to store the current batch number being loaded (this value is usually 1 for the first time): IDS_IDT_CURRENT_BATCH.sql
Run the IDS_IDT_CURRENT_BATCH_<ENTITY TO BE LOADED>.sql script to create the dynamic view to load the snapshot for the staging table created in the previous step. For example, execute the following script from the command line: IDS_IDT_CURRENT_BATCH_ACCOUNT.sql The following sample SQL scripts can be used to create the views to process the records in a given batch:
For more information about these example scripts, see Sample SQL Scripts on page 184.
Open the Informatica Identity Resolution client and perform a Load IDT. Load the remaining batches of data through the ISS batch Utility. Open a command window and navigate to the directory where the initial scripts for loading have been copied. Execute the initial load process by entering the following command at the command line: IDS_IDT_LOAD_ANY_ENTITY.CMD <Entity> <Work Directory> For example, execute the following script from the command line for account initial load: IDS_IDT_LOAD_ANY_ENTITY.CMD Account c:/initialLoad/logs
The following files contain the parameters used by the batch load utility; you must update these files to reflect your installation:
NOTE: Certain SQL and shell scripts are required to create materialized views and to load data incrementally. Depending on customer requirements, you can fine tune these sample files during implementation.
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
11 9
Configuring Data Quality with Oracle Data Quality Servers Process of Configuring Data Synchronization Between Siebel and Oracle Data Quality Matching Server
Incrementally load the failed batches individually using the following script from the command line: IDS_IDT_LOADBATCH_ANY_ENTITY <Entity> <Work Directory> <Batch Number> For example, execute the following script from the command line to load batch 33 of account: IDS_IDT_LOADBATCH_ANY_ENTITY.CMD Account c:/initialLoad/logs 33 Examine the log files and error files to ensure that all batches have successfully loaded. In the case of errors, use the information in the log and error files to determine the root cause for the failure and fix the underlying issue; repeat the load process as necessary.
Process of Configuring Data Synchronization Between Siebel and Oracle Data Quality Matching Server
When configuring data synchronization between the Siebel application and Oracle Data Quality Matching Server, perform the following tasks:
1 2 3
Configure the data source, see Configuring the Data Source on page 120. Configure the DQ Sync Services Business Service to Use Informatica Identity Resolution 9.01, see Upgrading to Informatica Identity Resolution 9.01 on page 108 (Step 1). Configuring Object Managers for ISS Data Source on page 121.
DSConnectString: This is the data source connect string. For the Microsoft SQL Server or the IBM DB2 databases, create an ODBC or equivalent connection and input the name of the connection in the parameter. For an Oracle RDBMS, specify the TNS name associated with the database, and not an ODBC or other entry.
120
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
Configuring Data Quality with Oracle Data Quality Servers Process of Configuring Data Synchronization Between Siebel and Oracle Data Quality Matching Server
DSSSQLStyle: This is the database SQL type. DSDLLName: This is the DLL Name corresponding to the SQL type. DSTableOwner: This is the data source table owner. DSUsername: This is the default user name used for connections (Optional). DSPassword: This is the default password used for connections (Optional).
3 4
Select a component, and then select Start Reconfiguration from the main menu. In the lower applet, query for the following parameter: OM - Named Data Source. NOTE: The value for this parameter is similar to the following: ServerDataSrc,GatewayDataSrc.
5 6 7
Add a comma after the last data source, then add the ISS data source you created in Configuring the Data Source on page 120. The default data source name is ISSDataSrc. Save the record, then select Commit Reconfiguration from the main menu. Repeat Step 3 through Step 6 for all required Object Managers. For example, add ISSDataSrc to the following components:
EAI Object Manager: OM - Named Data Source name UCM Object Manager: OM - Named Data Source name UCM Batch Manager: OM - Named Data Source name Data Quality Manager Workflow Process Batch Manager Workflow Process Manager
8 9
Click Synchronize to save the changes to the Gateway Configuration file. Start the Gateway Server and the Siebel Server.
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
12 1
Configuring Data Quality with Oracle Data Quality Servers Siebel Business Applications DQ Sync Action Sets
For more information about creating action sets, including creating actions for action sets, and associating events with action sets, see Siebel Personalization Administration Guide.
122
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
Configuring Data Quality with Oracle Data Quality Servers Siebel Business Applications DQ Sync Action Sets
Actions in DQ Sync DeleteRecord Account Action Set Name of Field Name Sequence Action Type Profile Attribute Set Operator Value Value ISS Set IDT Name 2 Attribute Set IDS_IDT Set IDS_01_IDT_ACCOUNT ISS Set BC Name 3 Attribute Set IDS_BC_NAME Set Account ISS Run DQSync 4 BusService DQ Sync Services SyncISS
Name Sequence Action Type Business Service Name Business Service Method
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
12 3
Configuring Data Quality with Oracle Data Quality Servers Siebel Business Applications DQ Sync Action Sets
Actions in DQ Sync PreDeleteRecord Account Action Set Name of Field Name Sequence Action Type Profile Attribute Set Operator Value Value ISS Set IDT Name 2 Attribute Set IDS_IDT Set IDS_01_IDT_ACCOUNT ISS Set ID 3 Attribute Set IDS_IO_ID Set [Id] ISS Set BC Name 4 Attribute Set IDS_BC_NAME Set Account ISS Run DQSync 5 BusService DQ Sync Services SyncISS
ISS Set ID
Name Sequence Action Type Business Service Name Business Service Method
124
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
Configuring Data Quality with Oracle Data Quality Servers Siebel Business Applications DQ Sync Action Sets
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
12 5
Configuring Data Quality with Oracle Data Quality Servers Siebel Business Applications DQ Sync Action Sets
Actions in DQ Sync PreWriteRecord Account Action Set Name of Field Name Sequence Action Type Business Service Name Business Service Method Value ISS Run DQSync 5 BusService DQ Sync Services SyncISS
126
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
Configuring Data Quality with Oracle Data Quality Servers Siebel Business Applications DQ Sync Action Sets
Actions in DQ Sync WriteRecord Account Action Set Name of Field Name Sequence Action Type Profile Attribute Set Operator Value Value ISS Set BC Name 4 Attribute Set IDS_BC_NAME Set Account ISS Run DQSync 5 BusService DQ Sync Services SyncISS
Name Sequence Action Type Business Service Name Business Service Method
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
12 7
Configuring Data Quality with Oracle Data Quality Servers Siebel Business Applications DQ Sync Action Sets
Actions in DQ Sync DeleteRecord Contact Action Set Name of Field Name Sequence Action Type Profile Attribute Set Operator Value Value ISS Set IDT Name 2 Attribute Set IDS_IDT Set IDS_01_IDT_CONTACT ISS Set BC Name 3 Attribute Set IDS_BC_NAME Set Contact ISS Run DQSync 4 BusService DQ Sync Services SyncISS
Name Sequence Action Type Business Service Name Business Service Method
128
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
Configuring Data Quality with Oracle Data Quality Servers Siebel Business Applications DQ Sync Action Sets
Actions in DQ Sync PreDeleteRecord Contact Action Set Name of Field Name Sequence Action Type Profile Attribute Set Operator Value Value ISS Set IDT Name 2 Attribute Set IDS_IDT Set IDS_01_IDT_CONTACT ISS Set ID 3 Attribute Set IDS_IO_ID Set [Id] ISS Set BC Name 4 Attribute Set IDS_BC_NAME Set Contact ISS Run DQSync 5 BusService DQ Sync Services SyncISS
ISS Set ID
Name Sequence Action Type Business Service Name Business Service Method
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
12 9
Configuring Data Quality with Oracle Data Quality Servers Siebel Business Applications DQ Sync Action Sets
130
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
Configuring Data Quality with Oracle Data Quality Servers Siebel Business Applications DQ Sync Action Sets
Actions in DQ Sync PreWriteRecord Contact Action Set Name of Field Name Sequence Action Type Business Service Name Business Service Method Value ISS Run DQSync 5 BusService DQ Sync Services SyncISS
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
13 1
Configuring Data Quality with Oracle Data Quality Servers Siebel Business Applications DQ Sync Action Sets
Actions in DQ Sync WriteRecord Contact Action Set Name of Field Name Sequence Action Type Profile Attribute Set Operator Value Value ISS Set BC Name 4 Attribute Set IDS_BC_NAME Set Contact ISS Run DQSync 5 BusService DQ Sync Services SyncISS
Name Sequence Action Type Business Service Name Business Service Method
132
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
Configuring Data Quality with Oracle Data Quality Servers Siebel Business Applications DQ Sync Action Sets
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
13 3
Configuring Data Quality with Oracle Data Quality Servers Siebel Business Applications DQ Sync Action Sets
Actions in DQ Sync DeleteRecord Prospect Action Set Name of Field Name Sequence Action Type Business Service Name Business Service Method Value ISS Run DQSync 4 BusService DQ Sync Services SyncISS
Actions in DQ Sync PreDeleteRecord Prospect Action Set Name of Field Name Sequence Action Type Profile Attribute Set Operator Value Value ISS Set System Name 1 Attribute Set IDS_SYSTEM Set siebeldq ISS Set IDT Name 2 Attribute Set IDS_IDT Set IDS_01_IDT_PROSPECT ISS Set ID 3 Attribute Set IDS_IO_ID Set [Id]
ISS Set ID
134
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
Configuring Data Quality with Oracle Data Quality Servers Siebel Business Applications DQ Sync Action Sets
Actions in DQ Sync PreDeleteRecord Prospect Action Set Name of Field Name Sequence Action Type Profile Attribute Set Operator Value Value ISS Set BC Name 4 Attribute Set IDS_BC_NAME Set Prospect ISS Run DQSync 5 BusService DQ Sync Services SyncISS
Name Sequence Action Type Business Service Name Business Service Method
Actions in DQ Sync PreWriteRecord Prospect Action Set Name of Field Name Sequence Action Type Profile Attribute Set Operator Value Value ISS Set System Name 1 Attribute Set IDS_SYSTEM Set siebeldq ISS Set IDT Name 2 Attribute Set IDS_IDT Set IDS_01_IDT_PROSPECT
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
13 5
Configuring Data Quality with Oracle Data Quality Servers Siebel Business Applications DQ Sync Action Sets
Actions in DQ Sync PreWriteRecord Prospect Action Set Name of Field Name Sequence Action Type Profile Attribute Set Operator Value Value ISS Set ID 3 Attribute Set IDS_IO_ID Set [Id] ISS Set BC Name 4 Attribute Set IDS_BC_NAME Set Prospect ISS Run DQSync 5 BusService DQ Sync Services SyncISS
ISS Set ID
Name Sequence Action Type Business Service Name Business Service Method
Actions in DQ Sync WriteRecord Prospect Action Set Name of Field Name Sequence Action Type Profile Attribute Set Operator Value Value ISS Set System Name 1 Attribute Set IDS_SYSTEM Set siebeldq
136
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
Configuring Data Quality with Oracle Data Quality Servers Siebel Business Applications DQ Sync Action Sets
Actions in DQ Sync WriteRecord Prospect Action Set Name of Field Name Sequence Action Type Profile Attribute Set Operator Value Value ISS Set IDT Name 2 Attribute Set IDS_IDT Set IDS_01_IDT_PROSPECT ISS Set ID 3 Attribute Set IDS_IO_ID Set [Id] ISS Set BC Name 4 Attribute Set IDS_BC_NAME Set Prospect ISS Run DQSync 5 BusService DQ Sync Services SyncISS
ISS Set ID
Name Sequence Action Type Business Service Name Business Service Method
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
13 7
Configuring Data Quality with Oracle Data Quality Servers Siebel Business Applications DQ Sync Action Sets
DQ Sync UpdateAddress
Table 51 describes the actions in the DQ Sync UpdateAddress action set. Table 51. Action ISS Run DQSync Actions in DQ Sync Update Address Action Set Name of Field Name Sequence Action Type Business Service Name Business Service Method Value ISS Run DQSync 1 BusService DQ Sync Services SyncISS
DQ Sync WriteRecordNew
Table 52 describes the actions in the DQ Sync WriteRecordNew action set. Table 52. Action ISS UPDATE FLAG Actions in DQ Sync WriteRecordNew Action Set Name of Field Name Sequence Action Type Profile Attribute Set Operator Value Value ISS UPDATE FLAG 1 Attribute Set IDS_IS_UPDATE Set false
DQ Sync WriteRecordUpdated
Table 53 describes the actions is in the DQ Sync WriteRecordUpdated action set. Table 53. Action ISS UPDATE FLAG Actions in DQ Sync WriteRecordUpdated Action Set Name of Field Name Sequence Action Type Profile Attribute Set Operator Value Value ISS UPDATE FLAG 1 Attribute Set IDS_IS_UPDATE Set true
138
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
Configuring Data Quality with Oracle Data Quality Servers Configuring Deduplication Against Multiple Addresses
For contact, you must use the Contact Match Against parameter to specify whether to match using one of the following:
NOTE: You cannot perform deduplication against both All Address and Primary Address. Only one option can be used for deduplication. Choosing to carry out deduplication against all addresses is performance intensive. The following procedure describes how to configure deduplication against multiple addresses. Once configured, deduplication against multiple addresses applies in real-time, Universal Customer Master (UCM) or Enterprise Application Integration (EAI) insertion, and batch match processing modes.
In the BC Vendor Field Mapping, configure the business component operation shown in the following table: Business Component Field CUT Address Operation DeDuplication
In the Vendor Parameter, configure the value shown in the following table: Name CUT Address DeDup Record Type Value CUT Address
In the field mapping for CUT Address, enter the values shown in the following table:
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
13 9
Configuring Data Quality with Oracle Data Quality Servers Configuring Deduplication Against Multiple Addresses
Business Component Field PositionCity Country Postal Code Row Id State Street Address
In your Siebel application, navigate to the Administration - Data Quality screen, then the Data Quality Settings view, and:
a b
In the Value field for the parameters shown in the following table, specify the appropriate settings. Log out of the application and log back in for the changes to take effect (you do not have to restart the Siebel Server). Parameter Account Match Against Description Set to one of the following: All Address to consider all addresses associated with an account for deduplication. Primary Address (the default value). If set to Primary Address, then only the primary address associated with an account is considered for deduplication.
Set to one of the following: All Address to consider all addresses associated with a contact for deduplication. Primary Address (default value). If set to Primary Address, then only the primary address associated with a contact is considered for deduplication.
Set the value of this property Yes. Set the value of this property to Yes.
140
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
Configuring Data Quality with Oracle Data Quality Servers Configuring Multiple Language Support for Data Matching
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
14 1
Configuring Data Quality with Oracle Data Quality Servers Configuring Multiple Language Support for Data Matching
Figure 2.
NOTE: In order to pick up all the records that belong to the same country when running a data quality batch processing task, it is mandatory to define a search specification (to pick up the records belonging to the same country). You can define a search specification by navigating to the Administration - Server Management screen, then the Jobs view. This feature can be extended as follows: Extended to have different match rules depending on the source of data (for example the Siebel application or other application). Extended to have different match rules depending on the mode of data entry (for example, realtime or batch processing mode). The procedure in Configuring Multiple Mode Support for Data Matching on page 145 describes how to configure multiple mode support for data matching when using the Oracle Data Quality Matching Server for data matching.
Use the following procedure to configure multiple language support for data matching when using the Oracle Data Quality Matching Server for data matching.
To configure the Siebel Server for Informatica Identity Resolution multiple language support 1
Create systems on the Informatica Identity Resolution server.
142
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
Configuring Data Quality with Oracle Data Quality Servers Configuring Multiple Language Support for Data Matching
Create separate SDF files for each Country (Population). Informatica provides Standard Populations for most of the countries. Standard Populations are distributed as part of SSANAME3 installation and can be copied separately if not selected when installing NAME3 server. NOTE: For more information about installing populations from the Windows Fix CD and adding populations to an existing installation, see the relevant documentation included in Siebel Business Applications Third-Party Bookshelf in the product media pack on Oracle Software Delivery Cloud.
Once all populations are in place, check and note the filename of each population, as this is the same name that is used in the SDF file. You can change System Name and System ID within the system definition file as follows: system-definition *================= NAME= siebeldq_XXXX ID= sYY Replace XXXX with Country, and YY with any number between and including 02 and 99. System ID 01 is reserved for Default System. For example, for Japanese population: filename : siebeldq_Japan.sdf Population files : japan.ysp Changes to SDF file: system-definition *================= NAME= siebeldq_Japan ID= s05 System ID 01 is reserved for Default System Replace all occurrences of Population(usa) to Population(japan). Similar changes are required for each sdf file.
Configure the ssadq_cfg.xml configuration file in the <siebsrvr>/SDQConnector folder. For example, add the following parameter to the ssadq_cfg.xml file: <Parameter> <Record_Type> <Name>Account_Japan</Name> <System>siebeldq_Japan</System> <Search>search-org</Search> <no_of_sessions>25</no_of_sessions> </Record_Type> </Parameter>
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
14 3
Configuring Data Quality with Oracle Data Quality Servers Configuring Multiple Language Support for Data Matching
In your Siebel application, navigate to the Administration - Data Quality screen, then the Data Quality Settings view and in the value field for the parameter shown in the following table, specify the following setting: Parameter Name Enable DQ Multiple Languages Value Yes
a b c
Navigate to Administration - Data Quality screen, then Third Party Administration view in your Siebel application. Select ISS as the third party vendor. Add the vendor parameters shown in the following table: Name Append Account Record Type Field 1 Append Contact Record Type Field 1 Value Primary Account Country Primary Personal Country
a b c
Navigate to Administration - Data Quality screen, then Third Party Administration view in your Siebel application. Select ISS as the third-party vendor. Add the vendor parameters shown in the following table: Name Batch Append Account Record Type Field 1 Batch Append Contact Record Type Field 1 Value Country Primary Personal Country
Add the user property to the ISS System Services business service. The user property that you add must correspond to the system name created in Informatica Identity Resolution for the respective country. For example, if the system created for Japan is siebeldq_Japan and the ID is set to 5, then the user property name must be siebeldq_Japan and the value 05, as shown in the following table. User Property Name siebeldq_Japan Value 05
144
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
Configuring Data Quality with Oracle Data Quality Servers Configuring Multiple Mode Support for Data Matching
As a prerequisite to configuring multiple mode support for data matching, multiple language support for data matching must be configured as described in Configuring Multiple Language Support for Data Matching on page 141. Follow the steps in the following procedure in order to use different match rules on a custom parameter (Source System). Using this procedure, the match rules that apply to data from source system 1 (EBIZ) will be different to the match rules that apply to data from source system 2 (SIEBEL).
Siebeldq_ebiz Siebeldq_siebel
The SDF file contains the IDT Layout Definition, Key Definition Logic, Match Fields considered for scoring records, the population to be used, and the Match purpose. NOTE: You must contact Informatica Technical Support in order to fine tune the SDF file.
Apply changes to the ssadq_cfg.xml file. For each system that you create in IIR, add the following parameters. There must be two entries, one for each source system (EBIZ and SIEBEL). <Record_Type> <Name>BCNAME_SOURCEFIELDVALUE</Name> <System>SYSTEM_NAME</System> <Search>SEARCH_CRITERIA</Search> <no_of_sessions>100</no_of_sessions> </Record_Type> The following example assumes that the source field is within the Account Business Component. <Parameter> <Record_Type> <Name>Account_EBIZ</Name> <System>SiebelDQ_EBIZ</System> <Search>search-org</Search>
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
14 5
Configuring Data Quality with Oracle Data Quality Servers Sample Configuration Files for Oracle Data Quality Matching Server
<no_of_sessions>100</no_of_sessions> </Record_Type> </Parameter> <Parameter> <Record_Type> <Name>Account_SIEBEL</Name> <System>SiebelDQ_SIEBEL</System> <Search>search-org</Search> <no_of_sessions>100</no_of_sessions> </Record_Type> </Parameter>
Navigate to the Administration - Data Quality screen, then the Third Party Administration view, and add the new vendor parameter for ISS as shown in the following table (this example assumes that Account is the Business Component Name, and Source is the Field): Parameter Name Append Account Record Type Field 1 Value Source (this the business component field name where the source information is stored).
ssadq_cfg.xml
The ssadq_cfg.xml file is used by Oracle Data Quality Matching Server. An example ssadq_cfg.xml file follows. <?xml version="1.0" encoding="UTF-16"?> <Data> <Parameter> <iss_host>hostName</iss_host> </Parameter> <Parameter> <iss_port>1666</iss_port> </Parameter> <Parameter> <rulebase_name>odb:0:userName/passWord@connectString</rulebase_name> </Parameter> <Parameter> <id_tag_name>DQ.RowId</id_tag_name> </Parameter>
146
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
Configuring Data Quality with Oracle Data Quality Servers Sample Configuration Files for Oracle Data Quality Matching Server
<Parameter> <Record_Type> <Name>Account_Denmark</Name> <System>SiebelDQ_Denmark</System> <Search>search-org</Search> <no_of_sessions>25</no_of_sessions> </Record_Type> </Parameter> <Parameter> <Record_Type> <Name>Account_USA</Name> <System>SiebelDQ_USA</System> <Search>search-org</Search> <no_of_sessions>25</no_of_sessions> </Record_Type> </Parameter> <Parameter> <Record_Type> <Name>Account_Germany</Name> <System>SiebelDQ_Germany</System> <Search>search-org</Search> <no_of_sessions>25</no_of_sessions> </Record_Type> </Parameter> <Parameter> <Record_Type> <Name>Account</Name> <System>SiebelDQ</System> <Search>search-org</Search> <no_of_sessions>25</no_of_sessions> </Record_Type> </Parameter> <Parameter> <Record_Type> <Name>Account_China</Name> <System>siebelDQ_China</System> <Search>search-org</Search> <no_of_sessions>25</no_of_sessions> </Record_Type> </Parameter> <Parameter> <Record_Type> <Name>Account_Japan</Name> <System>siebelDQ_Japan</System> <Search>search-org</Search> <no_of_sessions>25</no_of_sessions> </Record_Type> </Parameter>
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
14 7
Configuring Data Quality with Oracle Data Quality Servers Sample Configuration Files for Oracle Data Quality Matching Server
<Parameter> <Record_Type> <Name>Contact_Denmark</Name> <System>SiebelDQ_Denmark</System> <Search>search-person-name</Search> <no_of_sessions>25</no_of_sessions> </Record_Type> </Parameter> <Parameter> <Record_Type> <Name>Contact_USA</Name> <System>SiebelDQ_USA</System> <Search>search-person-name</Search> <no_of_sessions>25</no_of_sessions> </Record_Type> </Parameter> <Parameter> <Record_Type> <Name>Contact</Name> <System>SiebelDQ</System> <Search>search-person-name</Search> </Record_Type> </Parameter> <Parameter> <Record_Type> <Name>Contact_Germany</Name> <System>SiebelDQ_USA</System> <Search>search-person-name</Search> <no_of_sessions>25</no_of_sessions> </Record_Type> </Parameter> <Parameter> <Record_Type> <Name>Contact_China</Name> <System>SiebelDQ_China</System> <Search>search-person-name</Search> <no_of_sessions>25</no_of_sessions> </Record_Type> </Parameter> <Parameter> <Record_Type> <Name>Contact_Japan</Name> <System>SiebelDQ_Japan</System> <Search>search-person-name</Search> <no_of_sessions>25</no_of_sessions> </Record_Type> </Parameter> <Parameter>
148
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
Configuring Data Quality with Oracle Data Quality Servers Process of Installing the Oracle Data Quality Address Validation Server
<Record_Type> <Name>Prospect</Name> <System>SiebelDQ</System> <Search>search-prospect-name</Search> <no_of_sessions>25</no_of_sessions> </Record_Type> </Parameter> <Parameter> <Record_Type> <Name>Prospect_Denmark</Name> <System>SiebelDQ_Denmark</System> <Search>search-prospect-name</Search> <no_of_sessions>25</no_of_sessions> </Record_Type> </Parameter> <Parameter> <Record_Type> <Name>Prospect_USA</Name> <System>SiebelDQ_USA</System> <Search>search-prospect-name</Search> <no_of_sessions>25</no_of_sessions> </Record_Type> </Parameter> <Parameter> <Record_Type> <Name>Prospect_China</Name> <System>SiebelDQ_China</System> <Search>search-prospect-name</Search> <no_of_sessions>25</no_of_sessions> </Record_Type> </Parameter> <Parameter> <Record_Type> <Name>Prospect_Japan</Name> <System>SiebelDQ_Japan</System> <Search>search-prospect-name</Search> <no_of_sessions>25</no_of_sessions> </Record_Type> </Parameter> </Data>
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
14 9
Configuring Data Quality with Oracle Data Quality Servers Process of Installing the Oracle Data Quality Address Validation Server
1 2 3 4
Installing Oracle Data Quality Address Validation Server on page 150 Configuring Siebel Business Applications for the Oracle Data Quality Address Validation Server on page 155 Modifying Configuration Parameters for Oracle Data Quality Address Validation Server on page 151 Acquiring the License Key and Postal Directories for Oracle Data Quality Address Validation Server on page 153
Installing Oracle Data Quality Matching Server on page 92 Configuring Oracle Data Quality Matching Server on page 100
The ssadqasm.dll file uses the Oracle Data Quality Address Validation Server for address cleansing. You need a license to use the Oracle Data Quality Address Validation Server.
Obtain licensing for the postal directories (or postal validation databases), and then: NOTE: The postal directories and the license for the postal directories must be obtained directly from Informatica Address Doctor. For more information, see Acquiring the License Key and Postal Directories for Oracle Data Quality Address Validation Server on page 153. Informatica bundles geographies in different ways - for example, North America is cheaper than USA + Canada + Mexico.
Copy the postal validation databases to the following location: <InstallDir>:/InformaticaIR/ssaas/ad5/ad/db NOTE: This is the postal directory path for Informatica Address Doctor Version 5. For Informatica Address Doctor Version 4, the postal directory path is InformaticaIR/ssaas/ad/ ad/db.
150
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
Configuring Data Quality with Oracle Data Quality Servers Process of Installing the Oracle Data Quality Address Validation Server
Copy the following DLLs from informaticaIR/bin to siebelsrvr/bin: ssasec.dll ssadqasm.dll ssadqsea.dll ssaiok.dll NOTE: Copy these DLLs if using Windows. Copy the libXXXX.so DLLs if using UNIX. For UNIX, the target directory is siebsrvr/lib. For Windows, the target directory is siebsrvr\bin.] Make sure that you copy the DLLs to Siebel Server every time you upgrade or apply a new patch for your Siebel application.
Place the Oracle Data Quality Address Validation Server key file in the /ssaas/ad5/ad/db folder. For example: <InstallDir>:/InformaticaIR/ssaas/ad5/ad/db The key file contains an unlock code for specific databases; Informatica sends the key file along with the postal directories. NOTE: If using Informatica Identity Resolution 9.01, see Upgrading to Informatica Identity Resolution 9.01 on page 108 (Step 2).
a b c d
Navigate to Administration - Data screen, then the List of Values Explorer view. Click Query, and query for the following in the List of Values - Type field: COUNTRY. In the LOV explorer panel, click the COUNTRY node (by clicking the + sign) and navigate to the values for COUNTRY. Add a new entry for UNITED STATES.
Repeat this step for each country where you acquired postal directories. For example, add CANADA to the COUNTRY list of values in the same way, add MEXICO to the COUNTRY list of values in the same way, and so on.
Modifying Configuration Parameters for Oracle Data Quality Address Validation Server
The ssadq_cfgasm.xml file contains the global configuration parameters for Oracle Data Quality Address Validation Server. You must modify ssadq_cfgasm.xml in order to map Siebel CRM business components (<bc_field>) to data types supported by IIR (<data_type>). Table 54 lists the data types that are supported by IIR. For an example ssadq_cfgasm.xml file, see Sample Configuration Files for Oracle Data Quality Matching Server on page 146.
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
15 1
Configuring Data Quality with Oracle Data Quality Servers Process of Installing the Oracle Data Quality Address Validation Server
Table 54.
Data Types Supported by IIR Meaning For example: Lord, Sir, and so on. For example: Mr. Mrs, Dr, and so on. First name Middle name Last name For example: Manager, Director, and so on. Building name Sub building name House number Street address line 1 Street address line 2 Post office box number For example: URB, Colonia. For example: County For example: State Postal code Country name
Data Type Nobility Title FName MName LName Function Building SubBuilding HouseNumber Street1 Street2 POBox DeptLocality Locality Province Zip Country
In addition to providing field mappings, the ssadq_cfgasm.xml file defines a standardization operation (<std_operation>) for each field, which controls how the field will be standardized. Table 55 lists the standardization operations that are supported by IIR.
Table 55.
Standardization Operations Supported by IIR Description Convert text to upper case. Convert text to lower case. Convert text to camel case (upper case for first letter only)
Use the following procedure to modify configuration parameters for Oracle Data Quality Address Validation Server. This task is a step in Process of Installing the Oracle Data Quality Address Validation Server on page 149.
152
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
Configuring Data Quality with Oracle Data Quality Servers Process of Installing the Oracle Data Quality Address Validation Server
To modify configuration parameters for Oracle Data Quality Address Validation Server 1 2
Open the ssadq_cfgasm.xml file in a text editor. Use the following syntax to map a Siebel CRM business component field name to a supported IIR data type: <Parameter> <datacleanse_mapping> <mapping> <bc_field>AccountName</bc_field> <data_type>Organization</data_type> <std_operation>Camel</std_operation> </mapping> </datacleanse_mapping> </Parameter> This example maps the Siebel CRM business component field name AccountName to the supported IIR Organization data type, and defines camel as the standardization operation. Repeat this step as required.
Save the ssadq_cfgasm.xml file and copy it to the SDQConnector folder on Siebel Server for the changes to take effect: siebsrvr/SDQConnector
To integrate with the Informatica Address Doctor Version 5 postal directories, add the following tag to the ssadqasm_cfg.xml file located in siebelserver/SDQConnector: <Parameter> <asm_version>V5</asm_version> </Parameter>
When enabling data cleansing, you must add the country LOV value according to how the country is returned by the postal directory after cleansing. For example, if Country USA looks like UNITED STATES post cleansing, then you must add the LOV value UNITED STATES to the Country picklist. NOTE: An upgrade from Informatica Identity Resolution 2.7.04 to 2.8.07 should be treated like a new setup. In such cases, install Informatica Identity Resolution 2.8.07 on a new port, create a new system, perform the initial load, start the synchronizer to make it operational, and then delete the current Informatica Identity Resolution 2.7.04 setup.
Acquiring the License Key and Postal Directories for Oracle Data Quality Address Validation Server
The Oracle Data Quality Address Validation Server is installed as part of Informatica Identity Resolution installation. All content and license keys for the postal directories, however, must be purchased directly from Informatica Address Doctor. Subsequent updates and support for the postal directories is provided by Informatica Address Doctor also.
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
15 3
Configuring Data Quality with Oracle Data Quality Servers Process of Installing the Oracle Data Quality Address Validation Server
Address Doctor postal directories are currently certified in USA, Canada, and Australia. Address Doctor provides coverage for over 240 countries but not all coverage is the same. Address Doctor assigns a grade (A+, A, B) for each country's coverage. As this grade can change, it is recommended that you check the Address Doctor Web site at the following address for the latest grades: http://www.addressdoctor.com/en/countries_data/countries5.asp License keys, once purchased, provide a 12-month subscription to the postal directories and restrict the use of address validation to the purchased countries or territories. The maximum duration of the license key is 12 months. A postal directory is ultimately owned and managed by the country or territory that provides the postal data, and hence is managed differently across providers. You can keep an eye on postal directory updates by: Verifying the postal reference data on the Address Doctor Web site. Reviewing any update emails that Address Doctor sends.
Use the following procedure to acquire the license key and postal directories for Oracle Data Quality Address Validation Server. This task is a step in Process of Installing the Oracle Data Quality Address Validation Server on page 149.
To acquire the license key and postal directories for Oracle Data Quality Address Validation Server 1
Send the following information to Informatica Address Doctor using the email address [email protected]:
Full customer contact information, including: company name, contact name, email address, billing address, telephone, and fax numbers. The countries, regions, or territories for which you require the license and postal reference data. The platform on which the Oracle Data Quality Address Validation Server is deployed (for example, Oracle Solaris 10 or Windows 32 bit). The underlying Informatica product and version (for example, Informatica Identity Resolution version 2.7 or 2.8).
When you have purchased the license key and postal directories for Oracle Data Quality Address Validation Server:
Informatica Address Doctor emails the license key information to the named contact. Informatica Address Doctor support emails the credentials to download the reference key to the named contact.
You need this information to install and access the postal directories as described in Installing Oracle Data Quality Address Validation Server on page 150.
154
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
Configuring Data Quality with Oracle Data Quality Servers Configuring Siebel Business Applications for the Oracle Data Quality Address Validation Server
Configuring Siebel Business Applications for the Oracle Data Quality Address Validation Server
Configuring Siebel Business Applications for the Oracle Data Quality Address Validation Server involves enabling cleansing on all object managers, specifying data cleansing settings, and setting up preconfigured vendor parameters and field mapping values for the Oracle Data Quality Address Validation Server. Use the following procedure to configure Siebel Business Applications for the Oracle Data Quality Address Validation Server. This task is a step in Process of Configuring Oracle Data Quality Matching Server (IIR) on page 110.
To configure Siebel Business Applications for the Oracle Data Quality Address Validation Server 1
Open the uagent.cfg file in a text editor, and modify the [DataCleansing] section of the file to include the following: [DataCleansing] Enable=TRUE Type=ASM The uagent.cfg file is located in the Siebel/bin/w32u/enu directory.
In your Siebel application, enable data cleansing to use the Oracle Data Quality Address Validation Server as described in Enabling and Disabling Data Matching and Data Cleansing on page 33. For example, enable data cleansing at the object manager level, enterprise level, user level, and set the data quality settings (for data cleansing). Note that the Data Cleansing Type parameter must be set to ASM as shown in the following table. Parameter Name Data Cleansing Enable Flag Data Cleansing Type Value True ASM
Configure the ASM vendor applet (Oracle Data Quality Address Validation Server vendor applet) as shown in the following table by navigating to the Administration - Data Quality screen, then the Third Party Administration view. Vendor Applet Parameter Name Name DLL Name Vendor Applet Parameter Value ASM ssadqasm
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
15 5
Configuring Data Quality with Oracle Data Quality Servers Universal Connector Parameter and Field Mapping Values for Oracle Data Quality Address Validation Server
Verify that the preconfigured ASM vendor parameter and field mapping values are set up as listed in Universal Connector Parameter and Field Mapping Values for Oracle Data Quality Address Validation Server on page 156. For better control over the data returned by ASM, add the following vendor parameters for Oracle Data Quality Address Validation Server: Vendor Parameter Name ASM Country Database Return Code Description Specifies the ASM return codes under this vendor parameter for which any error messages returned are ignored and processing continues if the country database is not found. List return codes, separated by a comma. For example, if the customer is using Informatica Address Doctor Version 5 and the country database is not licensed, then specify the vendor parameter as follows: ASM Country Database Return Code: 25,26 ASM High Deliverability Return Code Specifies the ASM return codes under this vendor parameter for which addresses returned by the ASM Engine override the input address. If the ASM return code matches a return code defined within this vendor parameter, then the validated address sent by the ASM Engine is cleansed. In all other cases, the input address is retained. List return codes, separated by a comma. For example: ASM High Deliverability Return Code: 0,1,2,3,4,5,6,7,8 This vendor parameter applies only if the DQ Cleanse High Deliverable Address vendor parameter is set to Yes.
Modify the ssadq_cfgasm.xml file as described in Process of Installing the Oracle Data Quality Matching Server on page 90.
Universal Connector Parameter and Field Mapping Values for Oracle Data Quality Address Validation Server
This topic includes information about the Universal Connector parameters and field mapping values for the Oracle Data Quality Address Validation Server. Preconfigured Vendor Parameters for Oracle Data Quality Address Validation Server on page 157
156
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
Configuring Data Quality with Oracle Data Quality Servers Universal Connector Parameter and Field Mapping Values for Oracle Data Quality Address Validation Server
Preconfigured Field Mappings for Oracle Data Quality Address Validation Server on page 157
Preconfigured Vendor Parameters for Oracle Data Quality Address Validation Server
Table 56 lists the vendor parameters preconfigured for Oracle Data Quality Address Validation Server. You must not reconfigure the parameter settings.
Preconfigured Vendor Parameters for Oracle Data Quality Address Validation Server Value Account Contact Prospect Business Address Business Address No Yes "global", "iss-config-file", "ssadq_cfgasm.xml"
Account DataCleanse Record Type Contact DataCleanse Record Type List Mgmt Prospective Contact DataCleanse Record Type Personal Address DataCleanse Record Type CUT Address DataCleanse Record Type DQ Send Empty Field To Third Party Vendor DQ Cleanse High Deliverable Address Parameter 1
Preconfigured Field Mappings for Oracle Data Quality Address Validation Server
This topic includes information about the preconfigured Oracle Data Quality Address Validation Server field mappings for the following business components: Preconfigured Field Mappings for Business Component - Account Preconfigured Field Mappings for Business Component - Contact Preconfigured Field Mappings for Business Component - List Mgmt Prospective Contact Preconfigured Field Mappings for Business Component - CUT Address Preconfigured Field Mappings for Business Component - Personal Address
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
15 7
Configuring Data Quality with Oracle Data Quality Servers Universal Connector Parameter and Field Mapping Values for Oracle Data Quality Address Validation Server
Table 57.
Preconfigured Field Mappings for Oracle Data Quality Address Validation Server Business Component - Account Mapped Field Account.Name
Table 58.
Preconfigured Field Mappings for Oracle Data Quality Address Validation Server Business Component - Contact Mapped Field Contact.First Name Contact.Last Name Contact.Middle Name
Preconfigured Field Mappings for Business Component - List Mgmt Prospective Contact
Table 59 shows the data cleansing field mappings for the List Mgmt Prospective Contact business component and data cleansing operation.
Table 59.
Preconfigured Field Mappings for Oracle Data Quality Address Validation Server Business Component - List Mgmt Prospective Contact Mapped Field List Mgmt Prospective Contact.First Name List Mgmt Prospective Contact.Job Title List Mgmt Prospective Contact.Last Name List Mgmt Prospective Contact.Middle Name
Business Component Field First Name Job Title Last Name Middle Name
158
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
Configuring Data Quality with Oracle Data Quality Servers Sample Configuration File for Oracle Data Quality Address Validation Server
Table 60.
Preconfigured Field Mappings for Oracle Data Quality Address Validation Server Business Component - CUT Address Mapped Field Business Address.City Business Address.Country Business Address.Postal Code Business Address.State Business Address.Street Address Business Address.Street Address 2
Business Component Field City Country Postal Code State Street Address Street Address 2
Table 61.
Preconfigured Field Mappings for Oracle Data Quality Address Validation Server Business Component - Personal Address Mapped Field Business Address.City Business Address.Country Business Address.Postal Code Business Address.State Business Address.Street Address Business Address.Street Address 2
Business Component Field City Country Postal Code State Street Address Street Address 2
Sample Configuration File for Oracle Data Quality Address Validation Server
This topic provides an example of the ssadq_cfgasm file which is the configuration file used by Oracle Data Quality Address Validation Server.
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
15 9
Configuring Data Quality with Oracle Data Quality Servers Sample Configuration File for Oracle Data Quality Address Validation Server
ssadq_cfgasm.xml
The ssadq_cfgasm.xml file is used by Oracle Data Quality Address Validation Server. An example ssadq_cfgasm.xml file follows. <?xml version="1.0" encoding="UTF-8"?> <Data> <Parameter> <iss_host>hostname</iss_host> </Parameter> <Parameter> <iss_port>1666</iss_port> </Parameter> <Parameter> <format_zip>TRUE</format_zip> </Parameter> <Parameter> <datacleanse_mapping> <mapping> <field>Name</field> <ssafield>Organization</ssafield> <std_operation>Upper</std_operation> </mapping> <mapping> <field>Street_spcAddress</field> <ssafield>Street1</ssafield> <std_operation>Upper</std_operation> </mapping> <mapping> <field>City</field> <ssafield>Locality</ssafield> </mapping> <mapping> <field>Postal_spcCode</field> <ssafield>Zip</ssafield> </mapping> <mapping> <field>State</field> <ssafield>Province</ssafield> </mapping> <mapping> <field>Country</field> <ssafield>Country</ssafield> </mapping> <mapping> <field>First_spcName</field> <ssafield>FName</ssafield> <std_operation>Upper</std_operation> </mapping> <mapping>
160
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
Configuring Data Quality with Oracle Data Quality Servers Siebel Business Applications ISSSYNC Action Sets
<field>Middle_spcName</field> <ssafield>MName</ssafield> <std_operation>Upper</std_operation> </mapping> <mapping> <field>Last_spcName</field> <ssafield>LName</ssafield> <std_operation>Upper</std_operation> </mapping> <mapping> <field>Personal_spcPostal_spcCode</field> <ssafield>Zip</ssafield> </mapping> <mapping> <field>Personal_spcCity</field> <ssafield>Locality</ssafield> </mapping> <mapping> <field>Personal_spcState</field> <ssafield>Province</ssafield> </mapping> <mapping> <field>Personal_spcStreet_spcAddress</field> <ssafield>Street1</ssafield> <std_operation>Camel</std_operation> </mapping> <mapping> <field>Personal_spcStreet_spcAddress 2</field> <ssafield>Street2</ssafield> <std_operation>Camel</std_operation> </mapping> <mapping> <field>Personal_spcCountry</field> <ssafield>Country</ssafield> </mapping> </datacleanse_mapping> </Parameter> </Data>
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
16 1
Configuring Data Quality with Oracle Data Quality Servers Siebel Business Applications ISSSYNC Action Sets
This topic introduces the following ISSSYNC action sets for Siebel applications: ISSSYNC Action Sets for Account on page 162 ISSSYNC Action Sets for Contact on page 169 ISSSYNC Action Sets for List Mgmt Prospective Contact on page 175 Generic ISSSYNC Action Sets on page 183
ISSLoad Account
Table 62 describes the actions in the ISSLoad Account action set.
Actions in ISSLoad Account Action Set Name of Field Name Sequence Action Type Profile Attribute Set Operator Value Value ISS Set System Name 1 Attribute Set IDS_SYSTEM Set SiebelDQ ISS Set Page Size 2 Attribute Set IDS_PAGE_SIZE Set 80
162
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
Configuring Data Quality with Oracle Data Quality Servers Siebel Business Applications ISSSYNC Action Sets
Actions in ISSLoad Account Action Set Name of Field Name Sequence Action Type Profile Attribute Set Operator Value Value ISS Set File Name 3 Attribute Set IDS_LOADFILE Set "C:\ids\iss2704s\ids\data\account.xml" NOTE: Modify this value if you install Oracle Data Quality Matching Server on a drive other than C:\ drive.
ISS Set IDT Name 4 Attribute Set IDS_IDT Set IDS_01_IDT_ACCOUNT ISS Set IO Name 5 Attribute Set IDS_IO_NAME Set ISS_Account ISS Run WF 6 BusService Workflow Process Manager RunProcess "ProcessName", "ISS Launch Build Load File"
Set IO Name
ISS Run WF
Name Sequence Action Type Business Service Name Business Service Method Business Service Context
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
16 3
Configuring Data Quality with Oracle Data Quality Servers Siebel Business Applications ISSSYNC Action Sets
Actions in ISSSYNC DeleteRecord Account Action Set Name of Field Name Sequence Action Type Profile Attribute Set Operator Value Value ISS Set URL 1 Attribute Set IDS_URL Set "http://SERVERNAME:1671" NOTE: Replace SERVERNAME with the Hostname or IP address of the computer where XML Sync Server (XS) is installed.
ISS Run WF
Name Sequence Action Type Business Service Name Business Service Method Business Service Context
ISS Run WF 2 BusService Workflow Process Manager RunProcess "ProcessName", "ISS Launch Delete Record Sync"
Actions in ISSSYNC PreDeleteRecord Account Action Set Name of Field Name Sequence Action Type Profile Attribute Set Operator Value Value ISS Set System Name 1 Attribute Set IDS_SYSTEM Set SiebelDQ
164
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
Configuring Data Quality with Oracle Data Quality Servers Siebel Business Applications ISSSYNC Action Sets
Actions in ISSSYNC PreDeleteRecord Account Action Set Name of Field Name Sequence Action Type Profile Attribute Set Operator Value Value ISS Set IDT Name 2 Attribute Set IDS_IDT Set IDS_01_IDT_ACCOUNT ISS Set IO Name 3 Attribute Set IDS_IO_NAME Set ISS_Account ISS Set ID 4 Attribute Set IDS_IO_ID Set [Id] ISS Run WF 5 BusService Workflow Process Manager RunProcess "ProcessName", "ISS Launch PreDelete Record Sync"
ISS Set ID
ISS Run WF
Name Sequence Action Type Business Service Name Business Service Method Business Service Context
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
16 5
Configuring Data Quality with Oracle Data Quality Servers Siebel Business Applications ISSSYNC Action Sets
Actions in ISSSYNC PreWriteRecord Account Action Set Name of Field Name Sequence Action Type Profile Attribute Set Operator Value Value ISS Set System Name 1 Attribute Set IDS_SYSTEM Set SiebelDQ ISS Set IDT Name 2 Attribute Set IDS_IDT Set IDS_01_IDT_ACCOUNT ISS Set IO Name 3 Attribute Set IDS_IO_NAME Set ISS_Account ISS Set ID 4 Attribute Set IDS_IO_ID Set [Id]
ISS Set ID
166
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
Configuring Data Quality with Oracle Data Quality Servers Siebel Business Applications ISSSYNC Action Sets
Actions in ISSSYNC PreWriteRecord Account Action Set Name of Field Name Sequence Action Type Business Service Name Business Service Method Business Service Context Value ISS Run WF 5 BusService Workflow Process Manager RunProcess "ProcessName", "ISS Launch PreWrite Record Sync"
ISS Run WF
Actions in ISSSYNC WriteRecord Account Action Set Name of Field Name Sequence Action Type Profile Attribute Set Operator Value Value ISS Set System Name 1 Attribute Set IDS_SYSTEM Set SiebelDQ ISS Set IDT Name 2 Attribute Set IDS_IDT Set IDS_01_IDT_ACCOUNT ISS Set IO Name 3 Attribute Set IDS_IO_NAME Set ISS_Account
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
16 7
Configuring Data Quality with Oracle Data Quality Servers Siebel Business Applications ISSSYNC Action Sets
Actions in ISSSYNC WriteRecord Account Action Set Name of Field Name Sequence Action Type Profile Attribute Set Operator Value Value ISS Set ID 4 Attribute Set IDS_IO_ID Set [Id] ISS Set URL 5 Attribute Set IDS_URL Set "http://SERVERNAME:1671" NOTE: Replace SERVERNAME with the Hostname or IP address of the computer where XML Sync Server (XS) is installed.
ISS Set ID
ISS Run WF
Name Sequence Action Type Business Service Name Business Service Method Business Service Context
ISS Run WF 6 BusService Workflow Process Manager RunProcess "ProcessName", "ISS Launch Write Record Sync"
168
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
Configuring Data Quality with Oracle Data Quality Servers Siebel Business Applications ISSSYNC Action Sets
ISSLoad Contact
Table 67 describes the actions in the ISSLoad Contact action set.
Actions in ISSLoad Contact Action Set Name of Field Name Sequence Action Type Profile Attribute Set Operator Value Value ISS Set System Name 1 Attribute Set IDS_SYSTEM Set SiebelDQ ISS Set Page Size 2 Attribute Set IDS_PAGE_SIZE Set 80
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
16 9
Configuring Data Quality with Oracle Data Quality Servers Siebel Business Applications ISSSYNC Action Sets
Actions in ISSLoad Contact Action Set Name of Field Name Sequence Action Type Profile Attribute Set Operator Value Value ISS Set File Name 3 Attribute Set IDS_LOADFILE Set "C:\ids\iss2704s\ids\data\contact.xml" NOTE: Modify this value if you install Oracle Data Quality Matching Server on a drive other than C:\ drive.
ISS Set IDT Name 4 Attribute Set IDS_IDT Set IDS_01_IDT_CONTACT ISS Set IO Name 5 Attribute Set IDS_IO_NAME Set ISS_Contact ISS Run WF 6 BusService Workflow Process Manager RunProcess "ProcessName", "ISS Launch Build Load File"
ISS Run WF
Name Sequence Action Type Business Service Name Business Service Method Business Service Context
170
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
Configuring Data Quality with Oracle Data Quality Servers Siebel Business Applications ISSSYNC Action Sets
Actions in ISSSYNC PreDeleteRecord Contact Action Set Name of Field Name Sequence Action Type Profile Attribute Set Operator Value Value ISS Set System Name 1 Attribute Set IDS_SYSTEM Set SiebelDQ
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
17 1
Configuring Data Quality with Oracle Data Quality Servers Siebel Business Applications ISSSYNC Action Sets
Actions in ISSSYNC PreDeleteRecord Contact Action Set Name of Field Name Sequence Action Type Profile Attribute Set Operator Value Value ISS Set IDT Name 2 Attribute Set IDS_IDT Set IDS_01_IDT_CONTACT ISS Set IO Name 3 Attribute Set IDS_IO_NAME Set ISS_Contact ISS Set ID 4 Attribute Set IDS_IO_ID Set [Id] ISS Run WF 5 BusService Workflow Process Manager RunProcess "ProcessName", "ISS Launch PreDelete Record Sync"
ISS Set ID
ISS Run WF
Name Sequence Action Type Business Service Name Business Service Method Business Service Context
172
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
Configuring Data Quality with Oracle Data Quality Servers Siebel Business Applications ISSSYNC Action Sets
Actions in ISSSYNC PreWriteRecord Contact Action Set Name of Field Name Sequence Action Type Profile Attribute Set Operator Value Value ISS Set System Name 1 Attribute Set IDS_SYSTEM Set SiebelDQ ISS Set IDT Name 2 Attribute Set IDS_IDT Set IDS_01_IDT_CONTACT ISS Set IO Name 3 Attribute Set IDS_IO_NAME Set ISS_Contact ISS Set ID 4 Attribute Set IDS_IO_ID Set [Id]
ISS Set ID
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
17 3
Configuring Data Quality with Oracle Data Quality Servers Siebel Business Applications ISSSYNC Action Sets
Actions in ISSSYNC PreWriteRecord Contact Action Set Name of Field Name Sequence Action Type Business Service Name Business Service Method Business Service Context Value ISS Run WF 5 BusService Workflow Process Manager RunProcess "ProcessName", "ISS Launch PreWrite Record Sync"
ISS Run WF
Actions in ISSSYNC WriteRecord Contact Action Set Name of Field Name Sequence Action Type Profile Attribute Set Operator Value Value ISS Set System Name 1 Attribute Set IDS_SYSTEM Set SiebelDQ ISS Set IDT Name 2 Attribute Set IDS_IDT Set IDS_01_IDT_CONTACT ISS Set IO Name 3 Attribute Set IDS_IO_NAME Set ISS_Contact
174
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
Configuring Data Quality with Oracle Data Quality Servers Siebel Business Applications ISSSYNC Action Sets
Actions in ISSSYNC WriteRecord Contact Action Set Name of Field Name Sequence Action Type Profile Attribute Set Operator Value Value ISS Set ID 4 Attribute Set IDS_IO_ID Set [Id] ISS Set URL 5 Attribute Set IDS_URL Set "http://SERVERNAME:1671" NOTE: Replace SERVERNAME with the Hostname or IP address of the computer where XML Sync Server (XS) is installed.
ISS Set ID
ISS Run WF
Name Sequence Action Type Business Service Name Business Service Method Business Service Context
ISS Run WF 6 BusService Workflow Process Manager RunProcess "ProcessName", "ISS Launch Write Record Sync"
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
17 5
Configuring Data Quality with Oracle Data Quality Servers Siebel Business Applications ISSSYNC Action Sets
ISSLoad Prospect
Table 72 describes the actions in the ISSLoad Prospect action set. Table 72. Action ISS Set System Name Actions in ISSLoad Prospect Action Set Name of Field Name Sequence Action Type Profile Attribute Set Operator Value ISS Set Page Size Name Sequence Action Type Profile Attribute Set Operator Value ISS Set File Name Name Sequence Action Type Profile Attribute Set Operator Value Value ISS Set System Name 1 Attribute Set IDS_SYSTEM Set SiebelDQ ISS Set Page Size 2 Attribute Set IDS_PAGE_SIZE Set 80 ISS Set File Name 3 Attribute Set IDS_LOADFILE Set "C:\ids\iss2704s\ids\data\prospect.xml" NOTE: Modify this value if you install Oracle Data Quality Matching Server on a drive other than C:\ drive. ISS Set IDT Name Name Sequence Action Type Profile Attribute Set Operator Value ISS Set IDT Name 4 Attribute Set IDS_IDT Set IDS_01_IDT_PROSPECT
176
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
Configuring Data Quality with Oracle Data Quality Servers Siebel Business Applications ISSSYNC Action Sets
Actions in ISSLoad Prospect Action Set Name of Field Name Sequence Action Type Profile Attribute Set Operator Value Value ISS Set IO Name 5 Attribute Set IDS_IO_NAME Set ISS_List_Mgmt_Prospective_Contact ISS Run WF 6 BusService Workflow Process Manager RunProcess "ProcessName", "ISS Launch Build Load File"
ISS Run WF
Name Sequence Action Type Business Service Name Business Service Method Business Service Context
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
17 7
Configuring Data Quality with Oracle Data Quality Servers Siebel Business Applications ISSSYNC Action Sets
Actions in ISSSYNC DeleteRecord Prospect Action Set Name of Field Name Sequence Action Type Business Service Name Business Service Method Business Service Context Value ISS Run WF 2 BusService Workflow Process Manager RunProcess "ProcessName", "ISS Launch Delete Record Sync"
ISS Run WF
Actions in ISSSYNC PreDeleteRecord Prospect Action Set Name of Field Name Sequence Action Type Profile Attribute Set Operator Value Value ISS Set System Name 1 Attribute Set IDS_SYSTEM Set SiebelDQ ISS Set IDT Name 2 Attribute Set IDS_IDT Set IDS_01_IDT_PROSPECT ISS Set IO Name 3 Attribute Set IDS_IO_NAME Set ISS_List_Mgmt_Prospective_Contact
178
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
Configuring Data Quality with Oracle Data Quality Servers Siebel Business Applications ISSSYNC Action Sets
Actions in ISSSYNC PreDeleteRecord Prospect Action Set Name of Field Name Sequence Action Type Profile Attribute Set Operator Value Value ISS Set ID 4 Attribute Set IDS_IO_ID Set [Id] ISS Run WF 5 BusService Workflow Process Manager RunProcess "ProcessName", "ISS Launch PreDelete Record Sync"
ISS Set ID
ISS Run WF
Name Sequence Action Type Business Service Name Business Service Method Business Service Context
Actions in ISSSYNC PreWriteRecord Prospect Action Set Name of Field Name Sequence Action Type Profile Attribute Set Operator Value Value ISS Set System Name 1 Attribute Set IDS_SYSTEM Set SiebelDQ ISS Set IDT Name 2 Attribute Set IDS_IDT Set IDS_01_IDT_PROSPECT
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
17 9
Configuring Data Quality with Oracle Data Quality Servers Siebel Business Applications ISSSYNC Action Sets
Actions in ISSSYNC PreWriteRecord Prospect Action Set Name of Field Name Sequence Action Type Profile Attribute Set Operator Value Value ISS Set IO Name 3 Attribute Set IDS_IO_NAME Set ISS_List_Mgmt_Prospective_Contact ISS Set ID 4 Attribute Set IDS_IO_ID Set [Id] ISS Run WF 5 BusService Workflow Process Manager RunProcess "ProcessName", "ISS Launch PreWrite Record Sync"
ISS Set ID
ISS Run WF
Name Sequence Action Type Business Service Name Business Service Method Business Service Context
Actions in ISSSYNC WriteRecord Prospect Action Set Name of Field Name Sequence Action Type Profile Attribute Set Operator Value Value ISS Set System Name 1 Attribute Set IDS_SYSTEM Set SiebelDQ
180
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
Configuring Data Quality with Oracle Data Quality Servers Siebel Business Applications ISSSYNC Action Sets
Actions in ISSSYNC WriteRecord Prospect Action Set Name of Field Name Sequence Action Type Profile Attribute Set Operator Value Value ISS Set IDT Name 2 Attribute Set IDS_IDT Set IDS_01_IDT_PROSPECT ISS Set IO Name 3 Attribute Set IDS_IO_NAME Set ISS_List_Mgmt_Prospective_Contact ISS Set ID 4 Attribute Set IDS_IO_ID Set [Id] ISS Set URL 5 Attribute Set IDS_URL Set "http://SERVERNAME:1671" NOTE: Replace SERVERNAME with the Hostname or IP address of the computer where XML Sync Server (XS) is installed.
ISS Set ID
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
18 1
Configuring Data Quality with Oracle Data Quality Servers Siebel Business Applications ISSSYNC Action Sets
Actions in ISSSYNC WriteRecord Prospect Action Set Name of Field Name Sequence Action Type Business Service Name Business Service Method Value ISS Run WF 6 BusService Workflow Process Manager RunProcess "ProcessName", "ISS Launch Write Record Sync"
ISS Run WF
182
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
Configuring Data Quality with Oracle Data Quality Servers Siebel Business Applications ISSSYNC Action Sets
ISSSYNC WriteRecordNew
Table 77 describes the actions in the ISSSYNC WriteRecordNew action set. Table 77. Action ISS Run WF Actions in ISSSYNC WriteRecordNew Action Set Name of Field Name Sequence Action Type Business Service Name Business Service Method Business Service Context Value ISS Run WF 1 BusService Workflow Process Manager RunProcess "ProcessName", "ISS WriteRecordNew"
ISSSYNC WriteRecordUpdated
Table 78 describes the actions is in the ISSSYNC WriteRecordUpdated action set. Table 78. Action ISS Run WF Actions in ISSSYNC WriteRecordUpdated Action Set Name of Field Name Sequence Action Type Business Service Name Business Service Method Business Service Context Value ISS Run WF 1 BusService Workflow Process Manager RunProcess "ProcessName", "ISS WriteRecordUpdated"
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
18 3
Configuring Data Quality with Oracle Data Quality Servers Activating Action Sets in Your Siebel Business Application
Siebel Business Applications ISSSYNC Action Sets on page 161 Siebel Business Applications ISSSYNC Action Sets on page 161
For more information about creating action sets, including creating actions for action sets, see Siebel Personalization Administration Guide. NOTE: When verifying ISSSYNC action set setup, make sure that the IDS_URL profile attribute reflects the URL location of Oracle Data Quality Matching Server.
Verify that appropriate run-time events (seed data) are set up in your Siebel application by navigating to Administration - Runtime Events, then the Events view. For more information about run-time events, including how to call a workflow process from a runtime event, see Siebel Business Process Framework: Workflow Guide. For more information about associating events with action sets, see Siebel Personalization Administration Guide.
Activate the action sets for Account, Contact, and List Mgmt Prospective Contact, as follows:
a b c
Navigate to Administration - Runtime Events, then the Action Sets view. Select the Active checkbox for each Action Set that you want to activate. Reload the run-time events by clicking Menu, and selecting Reload Runtime Events.
184
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
Configuring Data Quality with Oracle Data Quality Servers Sample SQL Scripts
IDS_IDT_CURRENT_BATCH_PROSPECT.SQL on page 190 IDS_IDT_LOAD_ANY_ENTITY.CMD on page 190 (Windows) IDS_IDT_LOAD_ANY_ENTITY.sh on page 192 (UNIX)
IDS_IDT_ACCOUNT_STG.SQL
The following sample SQL script can be used for incremental data load. /* '============================================================================' ' Need to change TBLO before executing the scripts on target database. ' '============================================================================' */ SET SET SET SET SET SET SET TERMOUT ON FEEDBACK OFF VERIFY OFF TIME OFF TIMING OFF ECHO OFF PAUSE OFF
DROP MATERIALIZED VIEW ACCOUNTS_SNAPSHOT_VIEW; CREATE MATERIALIZED VIEW ACCOUNTS_SNAPSHOT_VIEW AS SELECT T2.ROW_ID ACCOUNT_ID, T2.NAME ACCOUNT_NAME, T3.ROW_ID ACCOUNT_ADDR_ID, T3.ADDR ADDRESS_LINE1, T3.ADDR_LINE_2 ADDRESS_LINE2, T3.COUNTRY COUNTRY, T3.STATE STATE, T3.CITY CITY, T3.ZIPCODE POSTAL_CODE, DECODE(T2.PR_BL_ADDR_ID,T3.ROW_ID,'Y','N') PRIMARY_FLAG, FLOOR((ROWNUM-1)/&BATCH_SIZE)+1 BATCH_NUM FROM dbo.S_CON_ADDR T1, dbo.S_ORG_EXT T2, dbo.S_ADDR_PER T3 WHERE T1.ACCNT_ID = T2.ROW_ID AND T1.ADDR_PER_ID = T3.ROW_ID -- Comment the following line for Multiple address match option -AND T2.PR_BL_ADDR_ID=T3.ROW_ID /
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
18 5
Configuring Data Quality with Oracle Data Quality Servers Sample SQL Scripts
SELECT '============================================================================' || CHR(10) || ' REPORT ON ACCOUNTS SNAPSHOT' || CHR(10) || '============================================================================' || CHR(10) " " FROM DUAL / SELECT BATCH_NUM BATCH, COUNT(*) "NUMBER OF RECORDS" FROM ACCOUNTS_SNAPSHOT_VIEW GROUP BY BATCH_NUM ORDER BY BATCH_NUM /
IDS_IDT_CONTACT_STG.SQL
The following sample SQL script can be used for incremental data load. /* ============================================================================ Need to change TBLO before executing the scripts on target database. ============================================================================ */ SET SET SET SET SET SET SET SET TERMOUT ON FEEDBACK OFF VERIFY OFF TIME OFF TIMING OFF ECHO OFF PAUSE OFF PAGESIZE 50
DROP MATERIALIZED VIEW CONTACTS_SNAPSHOT_VIEW; CREATE MATERIALIZED VIEW CONTACTS_SNAPSHOT_VIEW AS SELECT T1.CONTACT_IDCONTACT_ID, T2.FST_NAME || ' ' || LAST_NAMENAME, T2.MID_NAMEMIDDLE_NAME, T3.ROW_ID ADDRESS_ID, T3.CITY CITY, T3.COUNTRY COUNTRY, T3.ZIPCODE POSTAL_CODE, T3.STATE STATE, T3.ADDR STREETADDRESS, T3.ADDR_LINE_2ADDRESS_LINE2, DECODE(T2.PR_PER_ADDR_ID,T3.ROW_ID,'Y','N')PRIMARY_FLAG, T4.NAME ACCOUNT, T2.BIRTH_DTBirthDate, T2.CELL_PH_NUM CellularPhone,
186
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
Configuring Data Quality with Oracle Data Quality Servers Sample SQL Scripts
T2.EMAIL_ADDR EmailAddress, T2.HOME_PH_NUM HomePhone, T2.SOC_SECURITY_NUM SocialSecurityNumber, T2.WORK_PH_NUM WorkPhone, FLOOR((ROWNUM-1)/&BATCHSIZE)+1BATCH_NUM FROM dbo.S_CON_ADDR T1, dbo.S_CONTACT T2, dbo.S_ADDR_PER T3, dbo.S_ORG_EXT T4 WHERE T1.CONTACT_ID= T2.ROW_ID AND T1.ADDR_PER_ID = T3.ROW_ID AND -OR (T1.ADDR_PER_ID IS NULL)) Do we need contacts with no address? --Comment the following line for Multiple address match option -T2.PR_PER_ADDR_ID = T3.ROW_ID (+) AND T2.PR_DEPT_OU_ID = T4.PAR_ROW_ID (+) / SELECT '============================================================================' || CHR(10) || ' REPORT ON CONTACTS SNAPSHOT' || CHR(10) || '============================================================================' || CHR(10) " " FROM DUAL / SELECT BATCH_NUM BATCH, COUNT(*) "NUMBER OF RECORDS" FROM CONTACTS_SNAPSHOT_VIEW GROUP BY BATCH_NUM ORDER BY BATCH_NUM /
IDS_IDT_PROSPECT_STG.SQL
The following sample SQL script can be used for incremental data load. /* '============================================================================' ' Need to change TBLO before executing the scripts on target database. ' '============================================================================' */ SET SET SET SET SET SET SET SET TERMOUT ON FEEDBACK OFF VERIFY OFF TIME OFF TIMING OFF ECHO OFF PAUSE OFF PAGESIZE 50
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
18 7
Configuring Data Quality with Oracle Data Quality Servers Sample SQL Scripts
DROP MATERIALIZED VIEW PROSPECTS_SNAPSHOT_VIEW; CREATE MATERIALIZED VIEW PROSPECTS_SNAPSHOT_VIEW AS SELECT CON_PR_ACCT_NAME ACCOUNT_NAME, CELL_PH_NUMCELLULAR_PHONE, CITY CITY, COUNTRY COUNTRY, EMAIL_ADDREMAIL_ADDRESS, FST_NAME || ' ' || LAST_NAME NAME, HOME_PH_NUMHOME_PHONE, MID_NAMEMIDDLE_NAME, ZIPCODE POSTAL_CODE, SOC_SECURITY_NUMSOCIAL_SECURITY_NUMBER, STATE STATE, ADDR STREETADDRESS, ADDR_LINE_2 ADDRESS_LINE2, WORK_PH_NUMWORK_PHONE, ROW_ID PROSPECT_ID, FLOOR((ROWNUM-1)/&BATCH_SIZE)+1BATCH_NUM FROM dbo.S_PRSP_CONTACT T2 / SELECT '============================================================================' || CHR(10) || ' REPORT ON PROSPECTS SNAPSHOT' || CHR(10) || '============================================================================' || CHR(10) " " FROM DUAL / SELECT BATCH_NUM BATCH, COUNT(*) "NUMBER OF RECORDS" FROM PROSPECTS_SNAPSHOT_VIEW GROUP BY BATCH_NUM ORDER BY BATCH_NUM /
IDS_IDT_CURRENT_BATCH.SQL
The following sample SQL script can be used for incremental data load. SET FEEDBACK ON DROP TABLE IDS_IDT_CURRENT_BATCH / CREATE TABLE IDS_IDT_CURRENT_BATCH ( BATCH_NUM INTEGER) / INSERT INTO IDS_IDT_CURRENT_BATCH VALUES (1) /
188
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
Configuring Data Quality with Oracle Data Quality Servers Sample SQL Scripts
IDS_IDT_CURRENT_BATCH_ACCOUNT.SQL
The following sample SQL script can be used for incremental data load. CREATE OR REPLACE VIEW INIT_LOAD_ALL_ACCOUNTS AS SELECT ACCOUNT_ID, ACCOUNT_NAME, ACCOUNT_ADDR_ID, ADDRESS_LINE1, ADDRESS_LINE2, COUNTRY, STATE, CITY, POSTAL_CODE, PRIMARY_FLAG FROM ACCOUNTS_SNAPSHOT_VIEW WHERE BATCH_NUM= (SELECT BATCH_NUM FROM IDS_IDT_CURRENT_BATCH) /
IDS_IDT_CURRENT_BATCH_CONTACT.SQL
The following sample SQL script can be used for incremental data load. CREATE OR REPLACE VIEW INIT_LOAD_ALL_CONTACTS AS SELECT CONTACT_ID, NAME, MIDDLE_NAME, ADDRESS_ID, CITY, COUNTRY, POSTAL_CODE, STATE, STREETADDRESS, ADDRESS_LINE2, PRIMARY_FLAG, BirthDate, CellularPhone, EmailAddress, HomePhone, SocialSecurityNumber, WorkPhone, ACCOUNT FROM CONTACTS_SNAPSHOT_VIEW
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
18 9
Configuring Data Quality with Oracle Data Quality Servers Sample SQL Scripts
IDS_IDT_CURRENT_BATCH_PROSPECT.SQL
The following sample SQL script can be used for incremental data load. CREATE OR REPLACE VIEW INIT_LOAD_ALL_PROSPECTS AS SELECT ACCOUNT_NAME, CELLULAR_PHONE, CITY, COUNTRY, EMAIL_ADDRESS, NAME, HOME_PHONE, MIDDLE_NAME, POSTAL_CODE, SOCIAL_SECURITY_NUMBER, STATE, STREETADDRESS, WORK_PHONE, PROSPECT_ID FROM PROSPECTS_SNAPSHOT_VIEW WHERE BATCH_NUM= (SELECT BATCH_NUM FROM IDS_IDT_CURRENT_BATCH) /
IDS_IDT_LOAD_ANY_ENTITY.CMD
The following sample SQL script can be used for incremental data load. NOTE: Use this file for Microsoft Windows. @echo off REM REM REM REM REM REM ************************************************************************ * * * 1. Change informaticaHome to point to your IIR installation folder * * 2. Change initLoadScripts to point to your Initial Load scripts * * * ************************************************************************
190
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
Configuring Data Quality with Oracle Data Quality Servers Sample SQL Scripts
if %1.==. goto Error if %2.==. goto Error if %3.==. goto Error NOT %4.==. goto GIvenBatchOnly REM REM REM REM REM set set set set set set ************************************************************************ * * * Setting parameters * * * ************************************************************************ current=%1 workdir=%2 dbcredentials=%3 machineName=%computername% informaticaHome=C:\InformaticaIR initLoadScripts=C:\InformaticaIR\InitLoadScripts
REM ************************************************************************ REM * * REM * Find the number of batches in the current Entity records snapshot * REM * * REM ************************************************************************ FOR /F "usebackq delims=!" %%i IN (`sqlplus -s %dbcredentials% @GetBatchCount%1`) DO set xresult=%%i set /a NumBatches=%xresult% echo %NumBatches% del /s/f/q %workdir%\* setlocal enabledelayedexpansion set /a counter=1 REM ************************************************************************ REM * * REM * Loop through all the batches * REM * * REM ************************************************************************ for /l %%a in (2, 1, !NumBatches!) do ( set /a counter += 1 (echo counter=!counter!) sqlplus %dbcredentials% @%initLoadScripts%\SetBatchNumber.sql !counter! cd /d %informaticaHome%\bin idsbatch -h%machineName%:1669 -i%initLoadScripts%\idt_%current%_load.txt 1%workdir%\idt_%current%_load!counter!.log -2%workdir%\idt_%current%_load!counter!.err -3%workdir%\idt_%current%_load!counter!.dbg ) goto DONE :GivenBatchOnly echo Processing Batch %4.... sqlplus %dbcredentials% @%initLoadScripts%\SetBatchNumber.sql %4 cd /d %informaticaHome%\bin
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
19 1
Configuring Data Quality with Oracle Data Quality Servers Sample SQL Scripts
idsbatch -h%machineName%:1669 -i%initLoadScripts%\idt_%current%_load.txt 1%workdir%\idt_%current%_load%4.log -2%workdir%\idt_%current%_load%4.err 3%workdir%\idt_%current%_load%4.dbg goto DONE :Error ECHO Insufficient parameters echo usage "IDS_IDT_LOAD_ANY_ENTITY.CMD <Object_Name> <Work_Dir> <DBUser/ DBPassword@TNS_Entry_Name> [Optional Batch Number]" ECHO echo e.g. IDS_IDT_LOAD_ANY_ENTITY.CMD ACCOUNT C:\InformaticaIR\InitLoadScripts ora1234/ora1234@ora_db GOTO END :DONE Echo Process completed. Please examine error and log files in %workdir% OFF set /a errorcount = 0 %%R in (*.err) do if %%~zR neq 0 set /a errorcount += 1 error Count =%errorcount% if %errorcount% neq 0 goto batcherror GOTO END :batcherror echo %errorcount% batch/es have failed. Please check the following batches: for %%R in (*.err) do if %%~zR neq 0 echo %%R goto DONE :END
IDS_IDT_LOAD_ANY_ENTITY.sh
The following sample SQL script can be used for incremental data load. NOTE: Use this file for UNIX. #!/bin/bash ################################################################################# # Prerequisite check block # ################################################################################# # Checking IIR system variables are set. If not then throw error and exit. if [ -z "$SSABIN" ] && [ -z "$SSATOP" ] then echo "Err #LOAD-01: Informatica IIR system variables not set. Please use 'idsset' script" exit else # checking if required idsbatch utility exists at $SSABIN location if [ -f $SSABIN/idsbatch ] then echo "idsbatch utility found." fi
192
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
Configuring Data Quality with Oracle Data Quality Servers Sample SQL Scripts
fi
################################################################################# # Param block # ################################################################################# # INPUT PARAMETERS current=$1 workdir=$2 dbcredentials=$3 # ENVIRONMENT RELATED PARAMETERS # scriptdir=/export/home/qa1/InformaticaIR/initloadscripts # informaticadir=/export/home/qa1/InformaticaIR scriptdir=$SSATOP/initloadscripts =$SSATOP # DEBUG OPTION - 1 for ON and 0 for OFF debug=1 # Passing DB credentials as argument # ISS DATABASE CREDENTIALS and CONNECT INFO # dbcredentials=ora32155/ora32155@sdchs20n532_qa532a dbcredentials=$3 # MACHINE NAME machineName=`hostname` ################################################################################# # Execution block # ################################################################################# if [ $debug -eq 1 ]; then echo using Script dir: $scriptdir using Informatica Home: $informaticadir fi if [ $# -lt 3 ] then echo "Err #LOAD-03: Error in $0 - Invalid Argument Count" echo Usage LoadAnyEntity "<Entity Account,Contact or Prospect> <WorkDir> <dbuser/ dbpass@tnsname> [Optional Batch Number]" echo Insufficient parameters echo e.g "Syntax: $0 Entity_name Account Log_directory /temp" exit fi if [ -f $scriptdir/idt_$current\_load.txt ] then if [ $debug -eq 1 ]; then echo Using Load file $scriptdir/idt_$current\_load.txt
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
19 3
Configuring Data Quality with Oracle Data Quality Servers Sample SQL Scripts
fi else Load file cannot be loaded. Please check and rerun process fi if [ $# -eq 4 ] then Specific bath to be loaded: $4 fi if [ -d $workdir ]; then cd $workdir rm -r -f *.err fi if [ $# -eq 3 ] then read_sql_stmt() { typeset stmt=$1 typeset login=$2 echo " set feedback off verify off heading off pagesize 0 $stmt; exit " | sqlplus -s $login } read_sql_stmt "select max(batch_num) from "$current"s_SNAPSHOT_VIEW" "$dbcredentials" | while read u do batches=$u counter=2 if [ $debug -eq 1 ]; then echo current=$current echo workdir=$workdir echo counter=$counter echo number of batches to be processed is: $batches fi # for counter in $(seq 2 $batches); for ((counter=2; counter <= $batches; counter++)); currentbatch=$( sqlplus -S $dbcredentials <<! set head off feedback off echo off pages 0 UPDATE IDS_IDT_CURRENT_BATCH set batch_num=$counter / select batch_num from IDS_IDT_CURRENT_BATCH /
194
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
Configuring Data Quality with Oracle Data Quality Servers Sample SQL Scripts
! echo echo echo "#########################################" echo "# Curently Processing Batch: $currentbatch #" echo "#########################################" cd $informaticadir/bin if [ $debug -eq 1 ]; then echo InformaticaDrive: ${PWD} echo Processing following command: echo idsbatch -h$machineName:1669 -i$scriptdir/idt_$current\_load.txt -1$workdir/ idt_$current\_load$counter.log -2$workdir/idt_$current\_load$counter.err 3$workdir\idt_$current\_load$counter.dbg echo "#########################################" fi idsbatch -h$machineName:1669 -i$scriptdir/idt_$current\_load.txt -1$workdir/ idt_$current\_load$counter.log -2$workdir/idt_$current\_load$counter.err 3$workdir\idt_$current\_load$counter.dbg done done else counter=$4 echo "#########################################" echo Processing Batch $4.... currentbatch=$( sqlplus -S $dbcredentials <<! set head off feedback off echo off pages 0 UPDATE IDS_IDT_CURRENT_BATCH set batch_num=$counter / select batch_num from IDS_IDT_CURRENT_BATCH / ! ) echo "#########################################" cd $informaticadrive/bin idsbatch -h$machineName:1669 -i$scriptdir/idt_$current\_load.txt -1$workdir/ idt_$current\_load$counter.log -2$workdir/idt_$current\_load$counter.err 3$workdir\idt_$current\_load$counter.dbg fi echo "Process completed. Please examine error and log files in "$workdir # errorcnt=0 if [ -d $workdir ]; then cd $workdir fi errorcnt=$(find ./ -depth 1 -name "*.err" ! -size 0 | wc -l) echo Errors encountered is: $errorcnt if [ $errorcnt -eq 0 ]; then echo Successfully processed all the batches else
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
19 5
Configuring Data Quality with Oracle Data Quality Servers Sample SQL Scripts
######################################### # Failed batch report # ######################################### $errorcnt batch/es have failed. Please check the following batches: ./ -depth 1 -name "*.err"
196
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
Configuring Data Quality with Oracle Data Quality Servers Sample SiebelDQ.sdf File
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
19 7
Configuring Data Quality with Oracle Data Quality Servers Sample SiebelDQ.sdf File
OPTIONS= No-Null-Key, Compress-Key-Data(150) * idx-definition *============= NAME= IDX_CONTACT_ADDR ID= 3s IDT-NAME= IDT_CONTACT KEY-LOGIC= SSA, System(default), Population(usa), Controls("FIELD=Address_part1 KEY_LEVEL=Standard"), Field(StreetAddress), Null-Key("K$$$$$$$") OPTIONS= No-Null-Key, Compress-Key-Data(150) * idx-definition *============= NAME= IDX_CONTACT_ORG ID= 4s IDT-NAME= IDT_CONTACT KEY-LOGIC= SSA, System(default), Population(usa), Controls("FIELD=Organization_Name KEY_LEVEL=Standard"), Field(Account), Null-Key("K$$$$$$$") OPTIONS= No-Null-Key, Compress-Key-Data(150) * idx-definition *============= NAME= IDX_PROSPECT ID= 5s IDT-NAME= IDT_PROSPECT KEY-LOGIC= SSA, System(default), Population(usa), Controls("FIELD=Person_Name KEY_LEVEL=Standard"), Field(Name), Null-Key("K$$$$$$$") OPTIONS= No-Null-Key, Compress-Key-Data(150) * * ********************************************************************* * Loader and Job Definitions for Initial Load. You can remove the parameter OPTIONS=APPEND, if you are not doing an incremental load ********************************************************************* * loader-definition *==================== NAME= All_Load
198
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
Configuring Data Quality with Oracle Data Quality Servers Sample SiebelDQ.sdf File
JOB-LIST= job-account, job-contact, job-prospect * loader-definition *==================== NAME= siebel_prospect JOB-LIST= job-prospect OPTIONS= APPEND * loader-definition *==================== NAME= siebel_contact JOB-LIST= job-contact OPTIONS= APPEND * loader-definition *==================== NAME= siebel_account JOB-LIST= job-account OPTIONS= APPEND * job-definition *============= NAME= job-account FILE= lf-input-account IDX= IDX_ACCOUNT * job-definition *============= NAME= job-contact FILE= lf-input-contact IDX= IDX_CONTACT_NAME OPTIONS= Load-All-Indexes * job-definition *============= NAME= job-prospect FILE= lf-input-prospect IDX= IDX_PROSPECT * * logical-file-definition *====================== NAME= lf-input-account PHYSICAL-FILE= IDT_ACCOUNT *PHYSICAL-FILE= "+/data/account.xml" ************* * If Loading directly from Table, set PHYSICAL-FILE as Table Name,If loading from xml file set PHYSICAL-FILE as XML file name ************* INPUT-FORMAT= SQL *FORMAT= XML **********
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
19 9
Configuring Data Quality with Oracle Data Quality Servers Sample SiebelDQ.sdf File
*If Loading directly from Table, set INPUT-FORMAT as SQL, If loading from xml file use INPUT-FORMAT as XML ********* * logical-file-definition *====================== NAME= lf-input-contact PHYSICAL-FILE= IDT_CONTACT INPUT-FORMAT= SQL * logical-file-definition *====================== NAME= lf-input-prospect PHYSICAL-FILE= IDT_PROSPECT INPUT-FORMAT= SQL * user-job-definition *================== COMMENT= "Load Accounts" NAME= AccountLoad * user-step-definition *=================== COMMENT= "Step 0 for acct load" JOB= AccountLoad NUMBER= 0 NAME= runAccountLoad TYPE= "Load ID Table" PARAMETERS= ("Loader Definition",siebel_account) * user-job-definition *================== COMMENT= "Load contacts" NAME= ContactLoad * user-step-definition *=================== COMMENT= "Load Contacts" JOB= ContactLoad NUMBER= 0 NAME= runContactLoad TYPE= "Load ID Table" PARAMETERS= ("Loader Definition",siebel_contact) * user-job-definition *================== COMMENT= "Load Prospects" NAME= ProspectLoad * user-step-definition *=================== COMMENT= "Step 0 for prospect load" JOB= ProspectLoad NUMBER= 0
200
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
Configuring Data Quality with Oracle Data Quality Servers Sample SiebelDQ.sdf File
NAME= runProspectLoad TYPE= "Load ID Table" PARAMETERS= ("Loader Definition",siebel_prospect) * search-definition *================ NAME= "search-person-name" IDX= IDX_CONTACT_NAME COMMENT= "Use this to search and score on person" KEY-LOGIC= SSA, System(default), Population(usa), Controls("FIELD=Person_Name SEARCH_LEVEL=Typical"), Field(Name) SCORE-LOGIC= SSA, System(default), Population(usa), Controls("Purpose=Person_Name MATCH_LEVEL=Typical"), MatchingFields("Name:Person_Name,StreetAddress:Address_Part1,City:Address_part2,State:Attribut e1,PrimaryPostalCode:Postal_area") * ********** * Depending on the Business requirement, you can add or remove the fields to be used for matching from the "Matching-Fields" section *********
search-definition *================ NAME= "search-address" IDX= IDX_CONTACT_ADDR COMMENT= "Use this to search and score on person" KEY-LOGIC= SSA, System(default), Population(usa), Controls("FIELD=Address_part1 SEARCH_LEVEL=Typical"), Field(StreetAddress) SCORE-LOGIC= SSA, System(default), Population(usa), Controls("Purpose=Address MATCH_LEVEL=Typical"), Matching-Fields ("Name:Person_Name,StreetAddress:Address_Part1,City:Address_part2,State:Attribute1,Pri maryPostalCode:Postal_area") * search-definition *================ NAME= "search-company" IDX= IDX_CONTACT_ORG COMMENT= "Use this to search for a person within a company" KEY-LOGIC= SSA, System(default), Population(usa),
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
20 1
Configuring Data Quality with Oracle Data Quality Servers Sample SiebelDQ.sdf File
Controls("FIELD=Organization_Name SEARCH_LEVEL=Typical"), Field(Account) SCORE-LOGIC= SSA, System(default), Population(usa), Controls("Purpose=Contact MATCH_LEVEL=Typical"), Matching-Fields ("Account:Organization_Name,Name:Person_Name,StreetAddress:Address_Part1") * search-definition *================ NAME= "search-prospect-name" IDX= IDX_PROSPECT COMMENT= "Use this to search and score on prospect person" KEY-LOGIC= SSA, System(default), Population(usa), Controls("FIELD=Person_Name SEARCH_LEVEL=Typical"), Field(Name) SCORE-LOGIC= SSA, System(default), Population(usa), Controls("Purpose=Person_Name MATCH_LEVEL=Typical"), MatchingFields("Name:Person_Name,StreetAddress:Address_Part1,City:Address_Part2,State:Attribut e1,PostalCode:Postal_Area") * search-definition *================ NAME= "search-org" IDX= IDX_ACCOUNT COMMENT= "Use this to search and score on company" KEY-LOGIC= SSA, System(default), Population(usa), Controls("FIELD=Organization_Name SEARCH_LEVEL=Typical"), Field(Name) SCORE-LOGIC= SSA, System(default), Population(usa), Controls("Purpose=Organization MATCH_LEVEL=Typical"), Matching-Fields ("Name:Organization_Name,PAccountStrAddress:Address_Part1,PAccountCity:Address_Part2") * multi-search-definition *====================== NAME= "multi-search-direct-contact" SEARCH-LIST= "search-person-name,search-company,search-address" IDT-NAME= IDT_CONTACT * multi-search-definition *====================== NAME= "multi-search-contact" SEARCH-LIST= "search-person-name,search-company"
202
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
Configuring Data Quality with Oracle Data Quality Servers Sample SiebelDQ.sdf File
IDT-NAME= IDT_CONTACT * multi-search-definition *====================== NAME= "multi-search-person" SEARCH-LIST= "search-person-name,search-address" IDT-NAME= IDT_CONTACT * multi-search-definition *====================== NAME= "multi-search-division" SEARCH-LIST= "search-company,search-address" IDT-NAME= IDT_CONTACT * Section: User-Source-Tables * ********************************************************************* * Initial Load Database Source Views ********************************************************************** ************************************** * Staging Table for Account Data * Please refer the DQ Admin guide before changing the sequence of the fields ************************************** create_idt IDT_ACCOUNT sourced_from odb:15:ssa_src/ssa_src@ISS_DSN INIT_LOAD_ALL_ACCOUNTS.ACCOUNT_NAME Name V(100), INIT_LOAD_ALL_ACCOUNTS.ACCOUNT_ADDR_ID DUNSNumber V(60), INIT_LOAD_ALL_ACCOUNTS.ACCOUNT_ID (pk1) RowId C(30) , INIT_LOAD_ALL_ACCOUNTS.CITY PAccountCity V(100), INIT_LOAD_ALL_ACCOUNTS.COUNTRY PAccountCountry V(60), INIT_LOAD_ALL_ACCOUNTS.POSTAL_CODE PAccountPostalCode V(60), INIT_LOAD_ALL_ACCOUNTS.STATE PAccountState V(20), INIT_LOAD_ALL_ACCOUNTS.ADDRESS_LINE1 PAccountStrAddress V(100), INIT_LOAD_ALL_ACCOUNTS.ACCOUNT_ADDR_ID (pk2)PAccountAddressIDC(60) SYNC REPLACE_DUPLICATE_PK TXN-SOURCE NSA ; ********************************************************************** * Sample entries if Loading the data from Flat File ********************************************************************** *create_idt * IDT_ACCOUNT * sourced_from FLAT_FILE * NameW(100), * DUNSNumberW(60), * PAccountCityW(100), * PAccountCountryW(60), * PAccountPostalCodeW(60), * PAccountStateW(20), * PAccountStrAddressW(100), * (pk)RowIdC(30) *
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
20 3
Configuring Data Quality with Oracle Data Quality Servers Sample SiebelDQ.sdf File
*SYNC REPLACE_DUPLICATE_PK *TXN-SOURCE NSA *; ************************************** * Staging Table for Contact Data ************************************** create_idt IDT_CONTACT sourced_from odb:15:ssa_src/ssa_src@ISS_DSN INIT_LOAD_ALL_CONTACTS.BIRTHDATE BirthDate V(60), INIT_LOAD_ALL_CONTACTS.CELLULARPHONE CellularPhone V(60), INIT_LOAD_ALL_CONTACTS.EMAILADDRESS EmailAddress V(60), INIT_LOAD_ALL_CONTACTS.NAME NAME V(100), INIT_LOAD_ALL_CONTACTS.HOMEPHONE HomePhone V(60), INIT_LOAD_ALL_CONTACTS.MIDDLE_NAME MiddleName V(100), INIT_LOAD_ALL_CONTACTS.ACCOUNT Account V(100), INIT_LOAD_ALL_CONTACTS.CONTACT_ID (pk1) RowId C(30), INIT_LOAD_ALL_CONTACTS.SOCIALSECURITYNUMBER SocialSecurityNumber INIT_LOAD_ALL_CONTACTS.WORKPHONE WorkPhone V(60) , INIT_LOAD_ALL_CONTACTS.CITY City V(60), INIT_LOAD_ALL_CONTACTS.COUNTRY Country V(20), INIT_LOAD_ALL_CONTACTS.POSTAL_CODE PrimaryPostalCode V(20), INIT_LOAD_ALL_CONTACTS.STATE State V(20), INIT_LOAD_ALL_CONTACTS.STREETADDRESS StreetAddress V(100), INIT_LOAD_ALL_CONTACTS.ADDRESS_ID (pk2)ContactAddressIDC(60) SYNC REPLACE_DUPLICATE_PK TXN-SOURCE NSA ; ************************************** * Staging Table for Prospect Data **************************************
V(60),
create_idt IDT_PROSPECT sourced_from odb:15:ssa_src/ssa_src@ISS_DSN INIT_LOAD_ALL_PROSPECTS.ACCOUNT_NAME Account V(100), INIT_LOAD_ALL_PROSPECTS.CELLULAR_PHONE CellularPhone V(60), INIT_LOAD_ALL_PROSPECTS.CITY City V(60), INIT_LOAD_ALL_PROSPECTS.COUNTRY Country V(30), INIT_LOAD_ALL_PROSPECTS.EMAIL_ADDRESS EmailAddress V(60), INIT_LOAD_ALL_PROSPECTS.NAME NAME V(100), INIT_LOAD_ALL_PROSPECTS.HOME_PHONE HomePhone V(60), INIT_LOAD_ALL_PROSPECTS.MIDDLE_NAME MiddleName V(100), INIT_LOAD_ALL_PROSPECTS.POSTAL_CODE PostalCode V(20), INIT_LOAD_ALL_PROSPECTS.SOCIAL_SECURITY_NUMBER SocialSecurityNumber INIT_LOAD_ALL_PROSPECTS.STATE State V(20), INIT_LOAD_ALL_PROSPECTS.STREETADDRESS StreetAddress V(100), INIT_LOAD_ALL_PROSPECTS.WORK_PHONE WorkPhone V(100), INIT_LOAD_ALL_PROSPECTS.PROSPECT_ID (pk) RowId C(30) SYNC REPLACE_DUPLICATE_PK TXN-SOURCE NSA ; Section: Files Section: Views
V(60),
204
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
This chapter explains how to administer data quality in order to perform your data matching and data cleansing tasks. It includes the following topics: Data Quality Modes of Operation on page 206
Data Quality Rules on page 208 Data Quality Batch Job Parameters on page 210 Cleansing Data Using Batch Jobs on page 212 Matching Data Using Batch Jobs on page 213 Merge Algorithm in the Object Manager Layer on page 217 Merging of Duplicate Records on page 219 Process of Merging Duplicate Records on page 220 Using Fuzzy Query on page 222 Calling Data Matching and Data Cleansing from Scripts or Workflows on page 224 Troubleshooting Data Quality on page 231 About the Universal Connector API on page 232 About Universal Connector API Terminology on page 232 Vendor Libraries on page 233 Connector Initialization and Termination Functions on page 233 Session Initialization and Termination Functions on page 234 Parameter Setting Functions on page 235 External Key Generation Function on page 237 Error Message Function on page 238 Real-Time Data Matching Functions on page 239 Batch Mode Data Matching Functions on page 242 Real-Time Data Cleansing Function on page 247 Batch Mode Data Cleansing Function on page 248 Data Matching and Data Cleansing Algorithms on page 248 Viewing Parameter and Field Mapping Values for Universal Connector on page 250
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
20 5
In batch mode, you can use either the Administration - Server Management screen or the srvrmgr command-line utility to submit server component batch jobs. You run these batch jobs at intervals depending on business requirements and the amount of new and changed records. For data cleansing, a batch run standardizes and corrects a number of account, contact, prospect, or business address fields. You can cleanse all of the records for a business component or a subset of records. For more information about data cleansing batch tasks, see Cleansing Data Using Batch Jobs on page 212. For data matching, a batch run identifies potential duplicate record matches for account, contact, and prospect records. You can perform data matching for all of the records for a business component, or a subset of records. Potential duplicate records are presented to the data administrator for resolution in the Administration-Data Quality views. The duplicates can be resolved over time by a data steward (a person whose job is to monitor the quality of incoming and outgoing data for an organization.) For more information about data matching batch tasks, see Matching Data Using Batch Jobs on page 213.
206
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
You must then decide the fate of the new record, as follows: If you think the record is not a duplicate, close the dialog box or click Ignore All. The new record remains saved in the database and no change takes place. If you think the record is a duplicate, select the best-matching record from the dialog box using the Pick button. The duplicate record that you choose becomes the surviving record and the new record gets deleted after a sequenced merge with the surviving record as described in Sequenced Merges on page 219. In real-time mode, if you enter two new records that have the same Name and Location, then an error message displays similar to the following: The same values for (Name, Location) already exist. To enter a new record, make sure that field values are unique. Real-time data matching prevents creation of a duplicate record in the following ways: If you are in the process of creating a new record, that record is not saved. If you are in the process of modifying a record, the change is not made to the record.
NOTE: Only certain fields are configured to support data matching and data cleansing. If you do not enter values in these fields when you create a new record, or you do not modify the values in these fields when changing a record, data cleansing and data matching are not triggered. For more information about which fields are preconfigured for different business components, see Preconfigured Field Mappings for Oracle Data Quality Matching Server on page 112 and Preconfigured Field Mappings for Oracle Data Quality Matching Server on page 112.
You can specify a data quality rule in the batch job parameters. This is a convenient way of consolidating and reusing batch job parameters and also of overriding vendor parameters. For more information, see Data Quality Rules on page 208. For more information about using the Siebel Server Manager and administering component jobs, see Siebel System Administration Guide. In particular, read the chapters about the Siebel Enterprise Server architecture, using the Siebel Server Manager GUI, and using the Siebel Server Manager command-line interface.
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
20 7
You must run batch mode key generation on all existing records before you run real-time data matching. The Universal Connector requires generated keys in the key tables first before you can run real-time data matching. The key generation is done within the deduplication task (which is the reason for running deduplication on all existing records first). CAUTION: If you write custom Siebel CRM scripting on business components used for data matching (such as Account, Contact, List Mgmt Prospective Contact, and so on), the modifications to the fields by the script execute in the background and might not trigger logic that activates user interface features. For example, the scripting might not trigger UI features such as windows that show potential matching records.
208
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
Enter a value between 50 and 100. This value overrides the value in the Data Quality settings. Applicable for Operation Name DeDuplication only.
Select a business component name. Select the business object name corresponding to the business components
An example of a rule is shown in the following table. This is a rule for DeDuplication operations for all Account records whose name starts with Aa. Field Name Search Specification Vendor Name Operation Type Operation Name Threshold Source Business Component Source Business Object Value Rule_Batch_Account_Dedup [Name] LIKE 'Aa*' ISS Batch DeDuplication 60 Account Account
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
20 9
a b
Click the Rule Parameter view tab. Create rule parameters by selecting a parameter and entering the required value.
Table 79.
Data Quality Batch Job Parameters Required Yes Description The name of the business component: Possible values include: Account Contact List Mgmt Prospective Contact Business Address - applicable to Data Cleansing operations only. For Siebel Industry Applications, CUT Address is used instead of Business Address.
Yes
The name of the business object. Possible values include: Account Contact List Mgmt Business Address - applicable to Data Cleansing operations only
Yes
The type of operation: Possible values are: Data Cleansing - cleanses data Key Generate - generates match keys Key Refresh - refreshes match keys DeDuplication - performs data matching.
No
Applicable to Data Matching operations only. Indicates how candidate records are sorted for optimal processing by the data matching software. The default value is Dedup Token.
210
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
Table 79.
Data Quality Batch Job Parameters Required No Description Limits the number of records processed by a data quality task. Typically, you use the account's name or the contact's first name to split up large data quality batch tasks using the first letter of the name. For example, the following object WHERE clause selects only French account records where the account name begins with A: [Name] like 'A*' AND [Country] = 'France' As another example, the following object WHERE clause selects all records where Name begins with Paris or ends with london: [Name] like 'Paris*' or [Name] like '*london'
No
Specifies data quality settings for data cleansing and data matching jobs. This parameter has three values separated by commas: First value. If this value is set to Delete, existing duplicates are deleted. Otherwise, existing duplicates are not deleted. This is the only usage for this value. Second value. Applicable to the Universal Connector only. It specifies whether the job is a full or incremental data matching job. Third value. This is obsolete. Enter an empty string.
For more information about the use of DQSetting, see Matching Data Using Batch Jobs on page 213. Threshold Rule Name No No Specifies a value for the Threshold data quality parameter. Specifies the name of a data quality rule. A rule with the specified name must have been created in the Administration - Data Quality screen, Rules view. For example: RuleName="Rule_Batch_Account_Dedup" For more information, see Data Quality Rules on page 208.
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
21 1
Business Address
Contact
212
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
Contact
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
21 3
Example of Server Manager Command run task for comp DQMgr with DqSetting="'Delete'", bcname="List Mgmt Prospective Contact", bobjname="List Mgmt", opType=DeDuplication, objwhereclause="[Name] like 'search_string*'"
A typical example of a command for a full data matching job is as follows: run task for comp DQMgr with DqSetting="'','Yes','account_match.xml'", bcname=Account, bobjname=Account, opType=DeDuplication, objwhereclause="[Name] LIKE 'A*'" Jobs like this that perform data matching for a subset of records are still considered to be full data matching jobs because the data to be checked does not depend on earlier data matching.
Table 80.
DqSetting Parameter Details and Sample Values Valid Values Leave blank Yes or No (default) Comments Specify as two adjacent quotation marks. Specifies whether or not the same search specification is used for both the records whose duplicates are of interest and the candidate records that can include those duplicates. Use Yes for full data matching batch jobs. Use No for incremental data matching batch jobs. None.
DqSetting Parameter Sequence First section Second section (Enforce Search Spec on Candidate Records)
Third section
Leave blank
214
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
This kind of job is considered an incremental data matching job, because data matching was done earlier and does not need to be redone at this time. In an incremental data matching batch job, the records for which you want to locate duplicates are defined by the search specification, but the candidate records that can include those duplicates can be drawn from the whole applicable database table. Incremental data matching batch jobs are useful if you run them regularly, such as once a week. A typical example of a command for an incremental data matching job is as follows: run task for comp DQMgr with DqSetting="'','No',''", bcname=Account, bobjname=Account, opType=DeDuplication, objwhereclause="[Updated] > '08/18/2005 20:00:00' NOTE: If you do not specify the DQSetting parameter, or leave the second value of the DQSetting parameter blank, the job will be an incremental data matching job.
Example of Server Manager Command run task for comp DQMgr with bcname=Account, bobjname=Account, opType="Key Generate", objwhereclause="[Updated] > '07/18/2005 16:00:00'" run task for comp DQMgr with bcname=Account, bobjname=Account, opType="Key Refresh", objwhereclause="[Name] LIKE 'search_string*'" run task for comp DQMgr with bcname=Contact, bobjname=Contact, opType="Key Generate", objwhereclause="[Updated] > '07/01/2005 14:10:00'" run task for comp DQMgr with bcname=Contact, bobjname=Contact, opType="Key Refresh", objwhereclause="[Last Name] LIKE 'search_string*'"
Account
Refresh
Contact
Generate
Contact
Refresh
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
21 5
Example of Server Manager Command run task for comp DQMgr with bcname="List Mgmt Prospective Contact", bobjname="List Mgmt", opType="Key Generate", objwhereclause="[Updated] > '07/18/2005 16:00:00'" run task for comp DQMgr with bcname="List Mgmt Prospective Contact", bobjname="List Mgmt", opType="Key Refresh", objwhereclause="[Last Name] LIKE 'search_string*'"
Refresh
The examples in the table show slightly different WHERE clauses for key generation and key refresh operations, as follows:
The generation commands generate keys for all records in the business component that have been updated since the specified date and time. The refresh commands refresh keys for all records in the business component that match the search string in the specified field.
You can use either of these two types of WHERE clauses for both generation and refresh operations. If you want to generate or refresh keys for all records in the business component, use a WHERE clause containing a wildcard character (*) to match all records, as follows: objwhereclause="[field_name] LIKE '*'"
You must enable new custom Data Quality Manager components before you can use them. And, if you change parameters of running components, you must shut down and restart the components or restart the Siebel Server for the changes to take effect.
216
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
NOTE: For Siebel CRM Version 7.8 and later, you can also set specific parameters for a data quality task and save the configuration as a template by using the Administration - Server Configuration screen, Job Templates view. The benefit in doing so is that there is no need to copy component definitions. For more information about Siebel CRM templates, see Configuring Siebel Business Applications.
The links defined between the business components are used to implement the merge algorithm. The algorithm used by the merge process at the OM layer is explained in the following topics for one-tomany and many-to-many links.
One-to-Many Relationship
A one-to-many relationship defines the destination field, which is the foreign key in the detail table that points to a row in the parent table. Only links where the source field is "Id", that is, where the foreign key in the detail table stores the ROW_ID of the parent table row, are considered. To make children of A2 point to A1, the merge must update the destination field in the detail table to now point to the ROW_ID of A1.
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
21 7
User property name: Use Literals for Merge Use Literals For Merge: S_BU Value: TRUE When merging two records, the child records of the loser record point to the survivor record and the LAST_UPD and LAST_UPD_By columns of those child records are also updated. For example, account A2 is merged to account A1. Account A2 has service request SR1, and SR2. The columns LAST_UPD, and LAST_UPD_BY of SR1 and SR2 are updated during merge process. From the example, link account or quote foreign key in S_DOC_Quote is account Id (TARGET_OU_ID). TARGET_OU_ID stored the ROW_ID of the A2. It is now updated to point to ROW_ID of A1. SQL generated: UPDATE S_DOC_QUOTE set TARGET_OU_ID = 'Row Id of A1' where: TARGET_OU_ID is equal to 'Row Id of A2' While the merge is processing the link account or quote, it also checks to see if there are other foreign keys from quote pointing to account using the join definitions. These keys are also updated. An optimization is used to ensure that there are no redundant update statements. For example, if there are two links defined (account or quote and account or quote with primary with the same destination field Account Id), the process would update TARGET_OU_ID of S_DOC_QUOTE twice to point to A1. To avoid this scenario, a map of table name or column name of the processed field is maintained. The update is skipped if the column has been processed before. After the update you might have duplicate children for an account. For example, if the unique key for a quote is the name of the quote, merging two accounts with quotation marks of the same name will result in duplicates. The CONFLICT_ID column of children that will become duplicates after the merge is updated. This operation is performed before the actual update. The user must examine duplicate children (identified by CONFLICT_ID being set) to make sure that they are true duplicates. For example, if the merged account has child quotation marks named Q1 and Q1, it is possible that these refer to distinct quotation marks. If this is the case, the name of one of the quotation marks must be updated and the children must be merged.
Many-to-Many Relationship
The many-to-many relationship (Accounts-Contacts) differs slightly from the one-to-many relationship in that it is implemented using an intersection table that stores the ROW_IDs of parentchild records. On a merge, the associations must be updated. The Contacts associated with the old Account is now associated with the new Account. The Inter parent column of the intersection table is updated to point to the new parent. As in the one-to-many case, to avoid redundant updates, a map of intersection tables that have been processed is maintained. Therefore, if the source and target business components use the same base table, both child and parent columns are updated. The CONFLICT_ID column of intersection table entries that become duplicates after the merge is updated.
218
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
In contrast to the one-to-many link case, duplicates in the intersection table imply that the same child is being associated with the parent two or more times. However, there might be cases where the intersection table has entries besides the ROW_ID of the parent and child rows that store information specific to the association. The duplicate association records are only preserved when records are determined as unique (according to the intersection table unique key). This means those duplicate association records might have some unique attributes and these attributes are part of a unique key of the intersection table. CONFLICT_ID does not account for uniqueness among records.
Sequenced Merges
You use a sequenced merge to merge multiple records into one record. You assign sequence numbers to the records so that the record with the lowest sequence number becomes the surviving record, and the other records, the nonsurviving records, are merged with the surviving record. When records are merged using a sequence merge, the following rules apply: All non-NULL fields from the surviving record are kept. Any fields that were NULL in the surviving record are populated by information (if any) from the nonsurviving records. Missing fields in the surviving record are populated in ascending sequence number order from corresponding fields in the nonsurviving records. The children and grandchildren (for example, activities, orders, assets, service requests, and so on) of the nonsurviving records are merged by associating them to the surviving record.
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
21 9
Sequenced merge is especially useful if many fields are empty, such as when a contact record with a Sequence of 2 has a value for Email address, but its Work Phone # field is empty, and a contact record with a Sequence number of 3 has a value of Work Phone #. If the field Email address and Work Phone # in the surviving record (sequence number 1) are empty, the value of Email address is taken from the records with sequence number 2, and the value of Work Phone # is taken from the record of sequence number 3. A sequence number is required for each record even if there are only two records.
Filtering Duplicate Records on page 220 This involves creating a query to find a subset of the duplicate records and then review the query results. For example, you might want to create a query that includes a subset of all duplicate records where the Name field starts with the letter A.
Merging Duplicate Records on page 221 After the query results appear, you merge duplicate records using either the Merge button or the Merge Records option.
CAUTION: You must perform batch data matching first before trying to resolve duplicate records. For more information about batch data matching, see Batch Data Cleansing and Data Matching on page 207.
220
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
Click Query, enter your search criteria, and then click Go. The search results appear.
You now decide what you want to do with the duplicate records.
In the Duplicate view, drill down on one of the duplicate records. The appropriate Duplicate XXX Resolution view appears. The child applet shows the list of duplicate rows with the parent record appearing as the first row.
3 4
If two or more records appear to be duplicates, enter a sequence number in the Sequence field for each record. Edit the records, if necessary. For example, you might want to keep some values from fields in nonsurviving records. In this case, you can make fields NULL in what will be the surviving records. The values from the corresponding fields in the nonsurviving records are then used to populate the NULL fields after the sequenced merge.
5 6
Select the records to be merged. Click Merge. The records are merged to produce one new record. The record with the lowest sequence number assigned is retained after the merge. Missing fields in the retained record are populated from corresponding fields in the nonsurviving records, as described in Sequenced Merges on page 219.
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
22 1
If the conditions for fuzzy query are not satisfied, then any queries you make use standard query functionality.
222
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
3 4
Navigate to the Accounts screen, then the Account list view. Enter your query, and then click Go. For this example, in the Name field, enter Symphony. Up to 10 records having Name set to Symphony are displayed.
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
22 3
Administering Data Quality Calling Data Matching and Data Cleansing from Scripts or Workflows
NOTE: If the number of Symphony account records is fewer than 10, then the fuzzy query results includes records where symphony is lowercase (as well as uppercase). For example, if four records for Symphony and 100 records for symphony are found in the database, the fuzzy query result shows four Symphony records and six symphony records. However, if fuzzy query is disabled, only the four Symphony records appear.
You can also call data quality from external callers to perform data cleansing. There are preconfigured Data Cleansing business service methodsGet Siebel Fields and Parse. Using an external caller, such as scripting or a workflow process, you first call the Get Siebel Fields method, and then call the Parse method to cleanse contacts and accounts. The following scenarios provide more information about calling data quality from external callers: Scenario for Data Matching Using the Value Match Method on page 224 Scenario for Data Cleansing Using Data Cleansing Business Service Methods on page 225
224
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
Administering Data Quality Calling Data Matching and Data Cleansing from Scripts or Workflows
Scenario for Data Cleansing Using Data Cleansing Business Service Methods
This topic gives one example of how you can call the Data Cleansing business service methods using Siebel Workflow. You might use the methods differently, depending on your business mode. A system administrator or data steward in an enterprise wants to cleanse data before it enters the data through EAI or EIM interfaces. To do this, the system administrator or data steward uses a script or workflow that cleanses the data. The script or workflow calls the Get Siebel Fields method, which returns a list of cleansed fields for the applicable business component. Then the script or workflow calls the Parse method, which returns the data for the cleansed fields. For information about how to call and use the Get Siebel Fields and Parse methods, see Data Cleansing Business Service Methods on page 230.
Arguments The Value Match method consists of input and output arguments, some of which are property sets. Table 81 describes the input arguments, and Table 82 on page 227 describes the output arguments. CAUTION: The Value Match method arguments are specialized. Do not configure these components.
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
22 5
Administering Data Quality Calling Data Matching and Data Cleansing from Scripts or Workflows
Value Match Method Input Arguments Type Property Set Property Name Threshold Description The threshold score for a duplicate record. A match is considered only if the score exceeds this value. Comments Optional. The value Override can be specified to override the corresponding setting information obtained by the service from the administration screens, vendor properties, and so on. These name-value pairs are used as the matched value rather than the current row ID of the matched business component. The vendor field mappings for the matched business component are used to map the business component field names to vendor field names.
Match Values
Property Set
The matched business component's field name and the corresponding field value:
(Last Name, 'Smith') (First Name, 'John'), and so on ...
NOTE: Each pair must be a child property set of Match Values. BC Name Property BC Name The name of the matched business component. If set to N, the match modification date is not updated. If set to N, matches are not added to the result table. Instead, matches are determined by the business service.
Required.
Property
Property
226
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
Administering Data Quality Calling Data Matching and Data Cleansing from Scripts or Workflows
Value Match Method Input Arguments Type Property Property Name Support Multiple Child Entities Dedup Description If the input argument is True, the data in the Match Values input argument is in a new data format. NOTE: To support the Multiple Child Entities Dedup feature, vendors must support hierarchical data format. Comments Required.
MultipleChild EntitiesDedup
NOTE: Adapter Settings and Match Values are child property sets of the input property set. Return Value For each match, a separate child property set called Match Info is returned in the output with properties specific to the match (such as Matchee Row ID and Score), as well as some general output parameters as shown in Table 82. CAUTION: The Value Match method arguments are specialized. Do not configure these components.
Value Match Method Output Arguments Type Property Property Set Property Name End Time Matchee Row ID Score Description The run end time. The row ID of a matching record. The score of a matching record. Comments None If you match against existing records, the record ROW_IDs are found and returned in the Match Info property set. None None
Match Info NOTE: Match Info is a child property set of the output property set. Num Results Start Time
Property Property
Called From Any means by which you can call business service methods, such as with Siebel eScript or from a workflow process.
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
22 7
Administering Data Quality Calling Data Matching and Data Cleansing from Scripts or Workflows
Example The following is an example of using Siebel eScript to call the Value Match method. This script calls the Value Match method to look for duplicates of John Smith from the Contact business component and then returns matches, if any. After the script finishes, determine what you want to do with the duplicate records, that is, either merge or remove them.
228
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
Administering Data Quality Calling Data Matching and Data Cleansing from Scripts or Workflows
function Script_Open () { TheApplication().TraceOff(); TheApplication().TraceOn("sdq.log", "Allocation", "All"); TheApplication().Trace("Start of Trace"); // Create the Input property set and a placeholder for the Output property set var svcs; var sInput, sOutput, sAdapter, sMatchValues; var buscomp; svcs = TheApplication().GetService("DeDuplication"); sInput = TheApplication().NewPropertySet(); sOutput = TheApplication().NewPropertySet(); sAdapter = TheApplication().NewPropertySet(); sMatchValues = TheApplication().NewPropertySet(); // Set Generic Settings input property parameters sInput.SetProperty("BC Name", "Contact"); sInput.SetProperty("Use Result Table", "N"); sInput.SetType("Generic Settings"); // Set Match Values child input property parameters sMatchValues.SetProperty("Last Name", "Smith"); sMatchValues.SetProperty("First Name", "John"); sMatchValues.SetType("Match Values"); sInput.AddChild(sMatchValues); // Set Adapter Settings child input property parameters sAdapter.SetProperty("Search Level", "Narrow"); sAdapter.SetProperty("Population", "Default"); sAdapter.SetType("Adapter Settings"); sInput.AddChild(sAdapter); // Invoke the "Value Match" business service TheApplication().Trace("Property set created, ready to call Match method"); svcs.InvokeMethod("Value Match", sInput, sOutput); // Get the Output property set and its values TheApplication().Trace("Value Match method invoked"); var propName = ""; var propVal = ""; propName = sOutput.GetFirstProperty(); while (propName != "") { propVal = sOutput.GetProperty(propName); TheApplication().Trace(propName); TheApplication().Trace(propVal); propName = sOutput.GetNextProperty() } TheApplication().Trace("End Of Trace"); TheApplication().TraceOff(); }
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
22 9
Administering Data Quality Calling Data Matching and Data Cleansing from Scripts or Workflows
Table 83.
Get Siebel Fields Arguments Display Name Bus Comp Name Field Names Input or Output Input Output Data Type String Hierarchy Description The name of the business component. The name of the hierarchy. Required? No Yes
Return Value Child values: Name of the properties are Field 1, Field 2, and so on and corresponding values are Field Name.
Usage This method is used with the Parse method in the process of cleansing data in real time, and it is used with the Parse All function in the process of using a batch job to cleanse data.
Called From Any means by which you can call business service methods, such as with Siebel Workflow or Siebel eScript.
Parse Method
Parse is one of the methods of the Data Cleansing business service. This method returns the cleansed field data. For more information about business services and methods, see Siebel Developers Reference.
230
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
Table 84.
Parse Arguments Display Name Bus Comp Name Input or Output Input Data Type String Description The name of the business component. A list of field values. A list of field values. Required? No
Input Output
Hierarchy Hierarchy
Yes Yes
Return Value Child name values are Field Name and Field Date.
Usage This method is used following the Get Siebel Fields method in the process of cleansing data in real time.
Called From Any means by which you can call business service methods, such as with Siebel Workflow or Siebel eScript. For more information about Siebel Workflow, see Siebel Business Process Framework: Workflow Guide.
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
23 1
If you have configured new business components for data cleansing or data matching, also check the following: Business component Class property. Verify that the business component Class property is CSSBCBase. Vendor Properties. Verify that the vendor parameters and vendor field mappings have the correct values and that the values are formatted correctly. For example, there must be a space after a comma in vendor properties that have a compound value.
TIP: Check My Oracle Support regularly for updates to troubleshooting and other important information. For more information, see Information about Data Quality on My Oracle Support on page 259.
232
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
Vendor Libraries
Vendors must follow these rules for their DLLs or shared libraries: The libraries must be thread-safe. A library can support multiple sessions by using different unique session IDs. The libraries must support UTF-16 (UCS2) as the default Unicode encoding. If there is a single library for all supported languages, the libraries must be named as follows:
BASE.dll (on Windows) libBASE.so (on AIX and Oracle Solaris) libBASE.sl (on HP-UX)
where BASE is a name chosen by the vendor. If a vendor has many solutions for different types of data, then the vendor can use different base names for different libraries. If there are separate libraries for different languages, the library name must include the appropriate language code. For example, for Japanese (JPN), the libraries must be named as follows:
BASEjpn.dll (on Windows) libBASEjpn.so (on AIX and Oracle Solaris) libBASEjpn.sl (on HP-UX)
The Siebel application loads the libraries from the locations described in Table 4 on page 33. The mapping of Siebel application field names to vendor field names is stored as values of the relevant Business Component user properties in the Siebel repository. Storage of these field values is mandatory. Any other vendor-specific parameter required (for example, port number) for the vendors library must be stored outside of Siebel CRM.
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
23 3
sdq_init_connector Function
This function is called using the absolute installation path of the SDQConnector directory (./ siebsrvr/SDQConnector) when the vendor library is first loaded to facilitate any initialization tasks. It can be used by the vendor to read any configuration files it might choose to use. Syntax Parameters int sdq_init_connector (const SSchar * path) path: The absolute path of the Siebel Server installation. Vendors can use this path to locate any required parameter file for loading the necessary parameters (like port number and so on). This is a Unicode string because the Siebel Server can be installed for languages other than English. A return value of 0 indicates successful execution. Any other value is a vendor error code. The error message details from the vendor are obtained by calling the sdq_get_error_message function.
Return Value
sdq_shutdown_connector Function
This function is called when the Siebel Server is shutting down to perform any necessary cleanup tasks. Syntax Parameters Return Value int sdq_shutdown_connector (void) This function does not have any parameters. A return value of 0 indicates successful execution. Any other value is a vendor error code. The error message details from the vendor are obtained by calling the sdq_get_error_message function.
234
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
sdq_init_session Function
This function is called when the current session is initialized. This allows the vendor to initialize the parameters of a session or perform any other initialization tasks required. Syntax Parameters int sdq_init_session (int * session_id) session_id: A unique value provided by the vendor that is used in function calls while the session is active. The value 0 is reserved as an invalid session ID. The Siebel CRM code calls this function with a session ID of 0, so the session ID must be initialized to a nonzero value. A return value of 0 indicates successful execution. Any other value is a vendor error code. The error message details from the vendor are obtained by calling the sdq_get_error_message function.
Return Value
sdq_close_session Function
This function is called when a particular data cleansing or data matching operation is finished and it is required to close the session. Any necessary cleanup tasks are performed. Syntax Parameters Return Value int sdq_close_session (int * session_id) session_id: The session ID obtained by initializing the session. A return value of 0 indicates successful execution. Any other value is a vendor error code. The error message details from the vendor are obtained by calling the sdq_get_error_message function.
sdq_set_global_parameter Function
This function is called to set global parameters. This function call is made after the call to sdq_init_connector. The vendor must put the configuration file, if using one, in ./siebsrvr/SDQConnectorpath. When the vendor DLL is loaded, it calls the sdq_init_connector API function (if it is exposed by the vendor) with the absolute path to the SDQConnector directory. It is then up to the vendor to read the appropriate configuration file. The configuration file name is dependent on vendor specifications. An XML character string is used to specify the parameters. This provides an extensible way of providing parameters with each function call.
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
23 5
Using the sdq_set_global_parameter API, any global parameters specific to the vendor can be put as a user property to DeDuplication business service, where the format of the business service user property is as follows: "Global", "Parameter Name", "Parameter Value" These global parameters are set to the vendor only after the vendor DLL loads. You can define user properties for the DeDuplication business service as follows: Property: My Connector 1 Value: MyDQMatch Property: MyDQMatch Parameter 1 Value: "Global", "zGlobalParam1", "zGlobalParam1Val" Syntax Parameters int sdq_set_global_parameter (const SSchar* parameterList) parameterList: An XML character string that contains the list of parameters and values specific to this function call. An example of the XML is as follows: <Data> <Parameter> <GlobalParam1>GlobalParam1Val</GlobalParam1> </Parameter> </Data> Return Value A return value of 0 indicates successful execution. Any other value is a vendor error code. The error message details from the vendor are obtained by calling the sdq_get_error_message function.
sdq_set_parameter Function
This function is called, after the call to sdq_init_session, to set parameters that are applicable at the session context. The vendor must put the configuration file, if using one, in ./siebsrvr/SDQConnectorpath. When the vendor DLL is loaded, it calls the sdq_init_connector API function (if it is exposed by the vendor) with the absolute path to the SDQConnector directory. It is then up to vendor to read the appropriate configuration file. The configuration file name is dependent on vendor specifications. Using the sdq_set_parameter API, any session parameters specific to the vendor can be put as a user property to the DeDuplication business service, where the format of the business service user property is as follows: "Session", "Parameter Name", "Parameter Value"
236
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
These session parameters are set to the vendor, after each session opens with the vendor. You can define user properties for the DeDuplication business service as follows: Property: My Connector 1 Value: MyDQMatch Property: MyDQMatch Parameter 2 Value: "Session", "zSessParam2", "zSessParam2Val" Syntax Parameters int sdq_set_parameter (int session_id, const SSchar* parameterList) session_id: The session ID obtained while initializing the session. parameterList: An XML character string that contains the list of parameters and values that are specific to this function call. An example of the XML is as follows: <Data> <Parameter> <Name>RECORD_TYPE</Name> <Value>Contact</Value> </Parameter> <Parameter> <Name>SessionParam1</Name> <Value>SessionValue1</Value> </Parameter> </Data> Return Value A return value of 0 indicates successful execution. Any other value is a vendor error code. The error message details from the vendor are obtained by calling the sdq_get_error_message function.
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
23 7
sdq_get_dedup_keys Function
This function allows you to invoke a third party vendor to generate the deduplication keys. NOTE: To support Multiple Child Entities Deduplication, you must use the data format shown in Data Format for Deduplication of Multiple Child Entities on page 85. Syntax int sdq_get_dedup_keys (int session_id, SSchar* parameterList, SSchar* inputRecordSet, SSchar* outputRecordSet) Parameters session_id: The session ID obtained by initializing the session. parameterList: An XML character string contains the list of parameters and values that are specific to this function call. An XML example follows: <Data> <Parameter> <Name>KeyGenerationParam1</Name> <Value>KeyGenerationValue1</Value> </Parameter> <Parameter> <Name>KeyGenerationParam2</Name> <Value>KeyGenerationValue2</Value> </Parameter> </Data> Return Value A return value of 0 indicates successful execution. Any other value is a vendor error code. The error message details from the vendor are obtained by calling the sdq_get_error_message function.
sdq_get_error_message Function
This function is called if any of the Universal Connector functions return a code other than 0, which indicates an error. This function performs a message lookup and gets the summary and details for the error that just occurred for display to the user or writing to the log. Syntax Parameters void sdq_get_error_message (int error_code, SSchar * error_summary, SSchar * error_details) error_code: The error code returned from the previous function call.
238
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
Return Value
error_summary: A pointer to the error message summary, which is up to 256 characters long. error_details: A pointer to the error message details, which are up to 1024 characters long.
sdq_dedup_realtime Function
This function is called to perform real-time data matching when match candidate acquisition takes place in Siebel CRM. NOTE: To support Multiple Child Entities Deduplication, you must use the data format shown in Data Format for Deduplication of Multiple Child Entities on page 85. This function sends the data for each record as driver records and their candidate records. The function is called only once; multiple calls to the vendor library are not made even when the set of potential candidate records is huge. As all the candidate records are sent at once, all the duplicates for a given record are returned. Syntax Parameters int sdq_dedup_realtime (int session_id, SSchar* parameterList, SSchar* inputRecordSet, SSchar* outputRecordSet) session_id: The session ID obtained by initializing the session.
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
23 9
parameterList: An XML character string that contains the list of parameters and values that are specific to this function call. An XML example follows: <Data> <Parameter> <Name>RealTimeDedupParam1</Name> <Value>RealTimeDedupValue1</Value> </Parameter> <Parameter> <Name>RealTimeDedupParam2</Name> <Value>RealTimeDedupValue2</Value> </Parameter> </Data> NOTE: The parameterList parameter is set to NULL as all required parameters are already set at the session level.
inputRecordSet: An XML character string containing the driver record and candidate records. An XML example follows: <Data> <DriverRecord> <Account.Id>1-X42</Account.Id> <Account.Name>Siebel</Account.Name> <Account.Location>Headquarters</Account.Location> </DriverRecord> <CandidateRecord> <Account.Id>1-Y28</Account.Id> <Account.Name>Siebel</Account.Name> <Account.Location>Atlanta</Account.Location> </CandidateRecord> <CandidateRecord> <Account.Id>1-3-P</Account.Id> <Account.Name>Siebel</Account.Name> <Account.Location>Rome</Account.Location> </CandidateRecord> </Data>
240
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
outputRecordSet: An XML character string populated by the vendor in real time that contains the duplicate records with the scores. An XML example follows: <Data> <DuplicateRecord> <Account.Id>SAME ID AS DRIVER </Account.Id> <DQ.MatchScore></DQ.MatchScore> </DuplicateRecord> <DuplicateRecord> <Account.Id>1-Y28</Account.Id> <DQ.MatchScore>92</DQ.MatchScore> </DuplicateRecord> <DuplicateRecord> <Account.Id>1-3-P</Account.Id> <DQ.MatchScore>88</DQ.MatchScore> </DuplicateRecord> </Data>
Return Value
A return value of 0 indicates successful execution. Any other value is a vendor error code. The error message details from the vendor are obtained by calling the sdq_get_error_message function.
sdq_dedup_realtime_nomemory Function
This function is called to perform real-time data matching when match candidate acquisition takes place in Oracle Data Quality Matching Server. Syntax Parameters int sdq_dedup_realtime_nomemory (int session_id, SSchar* parameterList, SSchar* inputRecordSet, SSchar* outputRecordSet) session_id: The session ID obtained by initializing the session. parameterList: An XML character string that contains the list of parameters and values that are specific to this function call. An XML example follows: <Data> <Parameter> <Name>RealTimeDedupParam1</Name> <Value>RealTimeDedupValue1</Value> </Parameter> <Parameter> <Name>RealTimeDedupParam2</Name> <Value>RealTimeDedupValue2</Value> </Parameter> </Data> NOTE: The parameterList parameter is set to NULL as all required parameters are already set at the session level.
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
24 1
inputRecordSet: An XML character string containing the driver record. An XML example follows: <Data> <DriverRecord> <DUNSNumber>123456789</DUNSNumber> <Name>Siebel</Name> <<RowId>1-X40</RowId> </DriverRecord> </Data>
outputRecordSet: An XML character string populated by the vendor in real time that contains the duplicate records with the scores. An XML example follows: <Data> <DuplicateRecord> <Account.Id>SAME ID AS DRIVER </Account.Id> <DQ.MatchScore></DQ.MatchScore> </DuplicateRecord> <DuplicateRecord> <Account.Id>1-Y28</Account.Id> <DQ.MatchScore>92</DQ.MatchScore> </DuplicateRecord> <DuplicateRecord> <Account.Id>1-3-P</Account.Id> <DQ.MatchScore>88</DQ.MatchScore> </DuplicateRecord> </Data>
Return Value
A return value of 0 indicates successful execution. Any other value is a vendor error code. The error message details from the vendor are obtained by calling the sdq_get_error_message function.
sdq_set_dedup_candidates Function
This function is called to provide the list of candidate records in batch mode. The number of records sent during each invocation of this function is a customer-configurable deployment-time parameter. However, this is not communicated to the vendor at run time. NOTE: To support Multiple Child Entities Deduplication, you must use the data format shown in Data Format for Deduplication of Multiple Child Entities on page 85.
242
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
Syntax Parameters
int sdq_set_dedup_candidates (int session_id, SSchar* parameterList, SSchar* xmlRecordSet) session_id: The session ID obtained by initializing the session. parameterList: An XML character string that contains the list of parameters and values that are specific to this function call. An example of the XML is as follows: <Data> <Parameter> <Name>BatchDedupParam1</Name> <Value>BatchDedupValue1</Value> </Parameter> <Parameter> <Name>BatchDedupParam2</Name> <Value>BatchDedupValue2</Value> </Parameter> </Data> NOTE: The parameterList parameter is set to NULL as all required parameters are already set at the session level. xmlRecordSet: When match candidate acquisition takes place in Siebel CRM, the xmlRecordSet parameter is used as follows:
For full data matching batch jobs: An XML character string containing a list of candidate records. There is no driver record in the input set. An example of the XML is as follows: <Data> <CandidateRecord> <Account.Id>2-24-E</Account.Id> <Account.Name>Siebel</Account.Name> <Account.Location>Somewhere</Account.Location> </CandidateRecord> <CandidateRecord> <Account.Id>1-E-2E</Account.Id> <Account.Name>Siebel</Account.Name> <Account.Location>Somewhere else</Account.Location> </CandidateRecord> <CandidateRecord> <Account.Id>2-34-F</Account.Id> <Account.Name>Siebel</Account.Name> <Account.Location>Someplace</Account.Location> </CandidateRecord> </Data>
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
24 3
For incremental data matching batch jobs: As more candidate records are queried from the Siebel database and sent to the vendor software, the driver records must be marked so that the vendor software knows which records must return duplicate records: <Data> <DriverRecord> <Account.Id>2-24-E</Account.Id> <Account.Name>Siebel</Account.Name> <Account.Location>Somewhere</Account.Location> </DriverRecord> <CandidateRecord> <Account.Id>1-E-9E</Account.Id> <Account.Name>Siebel</Account.Name> <Account.Location>Somewhere else</Account.Location> </CandidateRecord> <DriverRecord> <Account.Id>1-E-2E</Account.Id> <Account.Name>Siebel</Account.Name> <Account.Location>Somewhere else</Account.Location> </DriverRecord> <CandidateRecord> <Account.Id>1-12-2H</Account.Id> <Account.Name>Siebel</Account.Name> <Account.Location>Somewhere else</Account.Location> </CandidateRecord> <DriverRecord> <Account.Id>2-34-F</Account.Id> <Account.Name>Siebel</Account.Name> <Account.Location>Someplace</Account.Location> </DriverRecord> </Data>
NOTE: The order of the driver records and candidate records is not significant. If a candidate has already been sent, it is not necessary to send it again even though it is a candidates associated with multiple driver records.
244
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
xmlRecordSet: When match candidate acquisition takes place in Oracle Data Quality Matching Server, the xmlRecordSet parameter is used as follows:
For full data matching batch jobs, an empty string is sent. For incremental data matching batch jobs, only driver records are sent.
An example of the XML is as follows: <Data> <DriverRecord> <DUNSNumber>123456789</DUNSNumber> <Name>Siebel</Name> <RowId>1-X40</RowId> </DriverRecord> <DriverRecord> <DUNSNumber>987654321</DUNSNumber> <Name>Oracle</Name> <RowId>1-X50</RowId> </DriverRecord> <DriverRecord> <DUNSNumber>123123123</DUNSNumber> <Name>IBM</Name> <RowId>1-X60</RowId> </DriverRecord> </Data> Return Value A return value of 0 indicates successful execution. Any other value is a vendor error code. The error message details from the vendor are obtained by calling the sdq_get_error_message function.
sdq_start_dedup Function
This function is called to start the data matching process in batch mode, and essentially signals that all the records to be used for data matching have been sent to the vendors application. Syntax Parameters Return Value int sdq_start_dedup (int session_id) session_id: The session ID obtained by initializing the session. This function does not have a return value.
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
24 5
sdq_get_duplicates Function
This function is called to get the master record with the list of its duplicate records along with their match scores. This is done in batch mode. The number of records received for each call to this function is set in the BATCH_MATCH_MAX_NUM_OF_RECORDS session parameter before the function is called. Syntax Parameters int sdq_get_duplicates (int session_id, SSchar* xmlResultSet) session_id: The session ID obtained by initializing the session. xmlRecordSet: An XML character string that the vendor library populates with a master record and a list of its duplicate records along with their match scores. If the number of duplicates is more than the value of the parameter BATCH_MATCH_MAX_NUM_OF_RECORDS, the results can be split across multiple function calls with each function call including the master record as well. The XML is in the following format: <Data> <ParentRecord> <DQ.MasterRecordsRowID>2-24-E</DQ.MasterRecordsRowID> <DuplicateRecord> <Account.Id>2-24-E</Account.Id> <DQ.MatchScore>92</DQ.MatchScore> </DuplicateRecord> <DuplicateRecord> <Account.Id>2-23-F</Account.Id> <DQ.MatchScore>88</DQ.MatchScore> </DuplicateRecord> </ParentRecord> </Data> Return Value A return value of 0 indicates successful execution, while a return value of 1 indicates that there are no duplicate records left. Any other value is a vendor error code. The error message details from the vendor are obtained by calling the sdq_get_error_message function. NOTE: Data quality code only processes the returned XML character string while the return value is 0. Even if there are fewer records to return than the value of the BATCH_MATCH_MAX_NUM_OF_RECORDS parameter, the vendor driver sends a return value of 0 and then return a value of 1 in the next call.
246
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
sdq_datacleanse Function
This function is called to perform real-time data cleansing. The function is called for only one record at a time. Syntax Parameters int sdq_datacleanse (int session_id, SSchar* parameterList, SSchar* inputRecordSet, SSchar* outputRecordSet) parameterList: An XML character string that contains the list of parameters and values that are specific to this function call. An example of the XML is as follows: <Data> <Parameter> <Name>RealTimeDataCleanseParam1</Name> <Value>RealTimeDataCleanseValue1</Value> </Parameter> <Parameter> <Name>RealTimeDataCleanseParam2</Name> <Value>RealTimeDataCleanseValue2</Value> </Parameter> </Data> NOTE: This parameter is set to NULL as all required parameters are already set at the session level. inputRecordSet: An XML character string containing the driver record. An example of the XML is as follows: <Data> <DriverRecord> <Contact.FirstName>michael</Contact.FirstName> <Contact.LastName>mouse</Contact.LastName> </DriverRecord> </Data> outputRecordSet: A record set that is populated by the vendor in real time and which contains the cleansed record. An example of the XML is as follows: <Data> <CleansedDriverRecord> <Contact.FirstName>Michael</Contact.FirstName> <Contact.LastName>Mouse</Contact.LastName> </CleansedDriverRecord> </Data> Return Value A return value of 0 indicates successful execution. Any other value is a vendor error code. The error message details from the vendor are obtained by calling the sdq_get_error_message function.
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
24 7
sdq_data_cleanse Function
The same function, is called by data quality code for both real-time and batch data cleansing. For batch data cleansing, the call is made with one record at a time.
1 2 3 4 5 6
Load the vendor library. Call sdq_init_connector. Call sdq_set_global_parameter. Call sdq_init_session. Call sdq_set_parameter (RECORD_TYPE Account/Contact/List Mgmt Prospective Contact, BATCH_DATAFLOW_NAME, BATCH_MATCH_MAX_NUM_OF_RECORDS) Query the Siebel database to get the candidate records. To get the candidate records, a query against the match key is executed. The match key itself is generated when a record is created, or key fields are updated. Universal Connector supports multiple key generation. For more information about match key generation, see Match Key Generation on page 27.
7 8 9
Call sdq_set_dedup_candidates. This function is called multiple times to send the list of all the candidate records. Call sdq_start_dedup to start the data matching process. Call sdq_getduplicate. This function is called multiple times to get all the master records and their duplicate records and until the function returns -1 indicating that there are no more records.
248
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
10 Call sdq_close_session (int * session_id) while logging out of the current session. 11 Call sdq_close_connector.
1 2 3 4 5 6
Load the vendor library. Call sdq_init_connector. Call sdq_set_global_parameter. Call sdq_init_session. Call sdq_set_parameter (RECORD_TYPE Account/Contact/List Mgmt Prospective Contact). Call sdq_get_dedup_keys. This function is called to get dedup keys of the driver record. The keys are then used to query for candidate records to send to then third party vendor in the dedup call which follows. Call sdq_dedup_realtime. Call sdq_close_session while logging out of current session. Call sdq_close_connector.
7 8 9
1 2 3 4 5 6 7 8 9
Load the vendor library. Call sdq_init_connector. Call sdq_set_global_parameter. Call sdq_init_session. Call sdq_set_parameter (RECORD_TYPE Account/Business Address/Contact/List Mgmt Prospective Contact, BATCH_DATAFLOW_NAME). Query the Siebel database to get the set of records to be cleansed. Call sdq_datacleanse. This function is called for each record in the result set of the query. It sends the driver record as XML and the output from the function has the cleansed driver record. After cleansing each record, save the record into the Siebel repository. Call sdq_close_session while logging out of current session.
10 Call sdq_close_connector.
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
24 9
Administering Data Quality Viewing Parameter and Field Mapping Values for Universal Connector
1 2 3 4 5 6 7 8 9
Load the vendor library. Call sdq_init_connector. Call sdq_set_global_parameter. Call sdq_init_session. Call sdq_set_parameter (RECORD_TYPE Account/Business Address/Contact/List Mgmt Prospective Contact). Query the Siebel database to get the Driver Record. Call sdq_datacleanse. This function sends the driver record as XML and the output from the function will have the cleansed driver record. Save the record into the Siebel repository. Call sdq_close_session while logging out of current session
10 Call sdq_close_connector.
1 2 3
Call sdq_init_session. Call sdq_set_parameter (RECORD_TYPE) - Account/Contact/Prospect). Call sdq_get_dedup_keys. (. This function sends the driver record as XML and the output from this call is the list of dedup keys generated by third party vendor. Siebel then stores the keys in its dedup key table.) Call sql_close_session.
This appendix provides examples of the preconfigured vendor parameter and field mapping values for the Universal Connector using third-party software. The definitions in this appendix are as preconfigured for Oracle Data Quality Matching Server and Oracle Data Quality Address Validation Server. This appendix includes the following topic: Viewing Parameter and Field Mapping Values for Universal Connector on page 250
250
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
Administering Data Quality Viewing Parameter and Field Mapping Values for Universal Connector
Related Topics Configuring Vendor Parameters on page 53 Mapping of Vendor Fields to Business Component Fields on page 53
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
25 1
Administering Data Quality Viewing Parameter and Field Mapping Values for Universal Connector
252
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
Siebel Data Quality is a user based license, containing the underlying infrastructure and business services for enabling data quality. All Siebel CRM data quality users must enable data quality at the user level using Siebel Data Quality. This appendix contains the following topic: Siebel Data Quality Objects on page 253
Business Components
Account Key Contact Key Prospect Key DQ Field Mapping Info DQ Mapping Config DQ Rule DQ Rule Parameter DQ Vendor Info DQ Vendor Parameter Data Quality Setting DeDuplication - Master (Account) DeDuplication - Master (Contact) DeDuplication - Slave (Account) DeDuplication - Slave (Contact) DeDuplication Results (Account) DeDuplication Results (Contact)
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
25 3
Related Data Quality Objects in Siebel Tools Siebel Data Quality Objects
Applets
Account Duplicate List Applet Contact Duplicate List Applet DQ BC Operation Info List applet DQ Field Mapping Info List applet DQ Parameter List Applet DQ Rule Definition Form Applet DQ Rule Name List Applet DQ Vendor Info List applet DQ Vendor Parameter List Applet Data Quality Setting List Applet DeDuplication - Account Duplicate Entry Applet DeDuplication - Account Duplicate Master List Applet DeDuplication - Account Duplicate Slave List Applet DeDuplication - Contact Duplicate Master List Applet DeDuplication - Contact Duplicate Slave List Applet DeDuplication - Contact Entry Applet DeDuplication - List Mgmt Prospective Contact Entry Applet DeDuplication - Prospect Duplicate Master List Applet DeDuplication - Prospect Duplicate Slave List Applet DeDuplication Results (Account) List Applet DeDuplication Results (Contact) List Applet DeDuplication Results (Prospect) List Applet Incomplete Address Applet Prospect Duplicate List Applet
Classes
254
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
Related Data Quality Objects in Siebel Tools Siebel Data Quality Objects
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
25 5
Related Data Quality Objects in Siebel Tools Siebel Data Quality Objects
256
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
This appendix discusses where to find information relevant to your use of the data quality products. It includes the following topics: Important Data Quality Resources on page 257 Data Quality Seed Data on page 259
Siebel Installation Guide for the operating system you are using for details on how to install data quality products Siebel System Administration Guide for details on how to administer, maintain, and configure your Siebel Servers Configuring Siebel Business Applications for information about configuring Siebel Business Applications using Siebel Tools Siebel Developers Reference for detailed descriptions of business components, user properties, and so on
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
25 7
Finding and Using Data Quality Information Important Data Quality Resources
Siebel Deployment Planning Guide to familiarize yourself with the basics of the underlying Siebel application architecture. Siebel System Monitoring and Diagnostics Guide. Going Live with Siebel Business Applications for information about how to migrate customizations from the development environment to the production environment. Siebel Security Guide for information about built-in seed data in the enterprise database, such as employee, position, and organization records. Siebel Performance Tuning Guide for information about tuning and monitoring specific areas of the Siebel application architecture and infrastructure, such as the object manager infrastructure.
Siebel Data Model Reference (Article ID 546778.1) for information about how data used by the Siebel application is stored in a standard third-party relational DBMS such as DB2, Microsoft SQL Server, or Oracle and some of the data integrity constraints validated by Siebel Business Applications. Siebel eScript Language Reference for information about writing scripts to extend data quality functionality. Siebel Applications Administration Guide for general information about administering Siebel Business Applications. Siebel Database Upgrade Guide or Siebel Database Upgrade Guide for DB2 UDB for z/OS for information about upgrading your installation. Siebel System Requirements and Supported Platforms on Oracle Technology Network for a definitive list of system requirements and supported operating systems for a release, including the following:
Information on supported third-party products A description of supported upgrade paths Lists of product and feature limitations; either unavailable in the release or in certain environments
Oracle Customer Hub (UCM) Master Data Management Reference provides reference information about Oracle Master Data Applications.
Third-Party Documentation
The third-party documentation, included in Siebel Business Applications Third-Party Bookshelf in the product media pack on Oracle Software Delivery Cloud, must be used as additional reference when using data quality products
258
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
Finding and Using Data Quality Information Data Quality Seed Data
476548.1 (Article ID) on My Oracle Support. This document was previously published as Siebel FAQ 1593. 476974.1 (Article ID) on My Oracle Support. This document was previously published as Siebel FAQ 1843. 476926.1 (Article ID) on My Oracle Support. This document was previously published as Siebel Alert 611.
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
25 9
Finding and Using Data Quality Information Data Quality Seed Data
260
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
Index
A
activation action sets 184 architecture, of data quality in relation to Siebel application 22
external vendors, configuring for 54 field mappings, about 53 connector, Universal Connector 21
B
batch data cleansing, about 207 batch data matching, about 207 batch mode data cleansing using batch mode 212 data matching using Universal Connector 213 data quality component jobs, customizing 216 described 206 full data matching jobs 214 generating keys, using batch mode 215 incremental data matching jobs 214 business components data cleansing, process of configuring for 51 data cleansing, troubleshooting 231 data matching, process of configuring for 51 business service DataCleansing 22 DeDuplication 22 Business Service methods data cleansing scenario 225 Get Siebel Fields 230 Parse 230 Value Match 225 business services user properties for Account 59 for Contact 60 for Data Cleansing 63 for DeDuplication 63 for DQ Sync Services 63 for ISS System Services 66 for List Mgmt Prospective Contact 62
D
data cleansing about 16 Account business component field mappings 112, 158 batch data cleansing, about 207 batch job parameters 210 batch mode 206 batch mode, about running data cleansing 212 business components, process of configuring 51 Business Service method scenario 225 Contact business component field mappings 113, 158 data quality component jobs for batch mode, customizing 216 defined 16 disabling for records 43 disabling without restarting 39 field mappings, about 53 Get Siebel Fields method 230 levels of enabling and disabling 33 List Mgmt Prospective Contact business component field mappings 158 optimizing performance 67 Parse method 230 Parse method invocation 231 real-time mode 206 real-time mode, about running in 206 troubleshooting 231 Universal Connector 25 data matching about 16 Account business component field mappings 112, 158 batch data matching, about 207 batch job parameters 210 batch mode 206, 213 business components, process of configuring 51 configuring a new field 115 configuring deduplication against multiple addresses 139
C
candidate records 29 configuration options data cleansing, process of configuring for 51 data matching, process of configuring for 51 Universal Connector, associating connector to a business component 52 connector mappings adding a field mapping 54
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
26 1
Index E
configuring multiple language support 141 configuring multiple mode support 145 Contact business component field mappings 113, 158 data quality component jobs for batch mode, customizing 216 data quality settings, applying 208 defined 16 disabling without restarting 39 duplicate records, filtering 220 duplicate records, merging 221 duplicate records, process of filtering and merging 220 full data matching jobs 214 generating keys using batch jobs 215 Get Siebel Fields method invocation 230 incremental data matching jobs 214 levels of enabling and disabling 33 List Mgmt Prospective Contact business component field mappings 113 Matching Server 26 optimizing performance 68 real-time mode 206 real-time mode, about running in 206 real-time mode, enabling using command line 41 sequenced merges, about 219 sequenced merges, field characteristics 220 set up process for Oracle Data Quality Matching Server 90 troubleshooting 231 Universal Connector 26 user preference options, setting 42 Value Match method called from example 227 Value Match method input property sets 225 Value Match method output property sets 227 Value Match method scenario 224 Value Match method, about 224, 225 data quality architecture, in relation to Siebel application 22 configuration options 48 configuring for Oracle Data Quality Matching Server 110 data cleansing using batch jobs 212 data matching using batch jobs 213 modes of operation 206 products 17 resources 257 seed data 259 troubleshooting 231
data quality component jobs, customizing 216 Data Quality Manager about using 207 customized component jobs, creating 216 data quality rules batch jobs 208 creating 208 rule parameters 208 data quality settings applying 208 Enable DataCleansing setting 37 Enable DeDuplication setting 37 Force User Dedupe Account 37 Force User Dedupe Contact 37 Force User Dedupe List Mgmt 38 Fuzzy Query - Max Returned 38 Fuzzy Query Enabled 38 Match Threshold 38 specifying 37 user preference options, setting 42 data quality software enabling at the Enterprise level 35 enabling at the object manager level 39 data quality user properties 58 for Account 59 for Contact 60 for Data Cleansing 63 for DeDuplication 63 for DQ Sync Services 63 for ISS System Services 66 for List Mgmt Prospective Contact 62 data source definition, configuring 120 data synchronization, configuring between Siebel and Oracle Data Quality Matching Server 120 DataCleansing business service 22 Dedup Query and Dedup Token Expression 29 Dedup Token Expression key 28 DeDuplication business service 22 deduplication, configuring data matching against multiple addresses 139 duplicate records 31 dynamic link libraries (DLLs) libraries supported 49 vendor 233
E
Enable Data Cleansing field 42 Enable DataCleansing setting 37 Enable DeDuplication setting 37 example configuration files
262
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
Index F
ssadq_cfg.xml 146 ssadq_cfgasm.xml 160 example SDF file, SiebelDQ.sdf 197 example SQL scripts 184 IDS_IDT_ACCOUNT_STG.SQL 185 IDS_IDT_CONTACT_STG.SQL 186 IDS_IDT_CURRENT_BATCH.SQL 188 IDS_IDT_CURRENT_BATCH_ACCOUNT.SQL 189 IDS_IDT_CURRENT_BATCH_CONTACT.SQL 189 IDS_IDT_CURRENT_BATCH_PROSPECT.SQL 190 IDS_IDT_LOAD_ANY_ENTITY.CMD 190 IDS_IDT_LOAD_ANY_ENTITY.sh 192 IDS_IDT_PROSPECT_STG.SQL 187 external callers, calling data quality from 224
I
incremental data load 118 installation Oracle Data Quality Matching Server on UNIX 94 Oracle Data Quality Matching Server, on Windows 92 third-party software, about installing for using with Universal Connector 49 Universal Connector 49 ISS Data Source, configuring object managers for 121
K
keys, Dedup Token Expression 28
L
libraries, dynamic link libraries (DLLs), supported 49 loading Siebel data incrementally into Informatica Identity Resolution tables 118
F
field mappings Account business component 112, 158 Contact business component 113, 158 mapping Universal Connector data cleansing fields to Siebel business component fields 55 mapping Universal Connector data matching fields to Siebel business component fields 54 preconfigured, for Oracle Data Quality Address Validation Server 157 preconfigured, for Oracle Data Quality Matching Server 112 filtering duplicate records 220 Force User Dedupe Account setting 37 Force User Dedupe Contact setting 37 Force User Dedupe List Mgmt setting 38 fuzzy query 32 configuring mandatory fields 58 enabling and disabling 43 example of enabling for use with Accounts 223 Fuzzy Query - Max Returned data quality setting 38 Fuzzy Query Enabled data quality setting 38 identifying mandatory fields 45 using 222
M
match key, defined 27 match scores, calculating using Universal Connector 30 Match Threshold data quality setting 38 Merge button 219 Merge Records option 219 merge, about sequenced merges 219 merge, sequenced merges field characteristics 220 merging duplicate records 219, 221 merging duplicate records, process 220 modes, operation modes described 206 multiple addresses, configuring deduplication against 139 multiple language support, configuring for data matching 141 multiple mode support, configuring for data matching 145
O
object managers, configuring for ISS Data Source 121 optimizing data cleansing performance 67 optimizing data matching performance 68 Oracle Data Quality Address Validation Server about 19 configuration parameters, modifying 156
G
generating keys using batch mode 215 Get Siebel Fields method 230
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
26 3
Index P
configuring 155 installing 149 set up process for data cleansing 149 Oracle Data Quality Matching Server about 18 activating action sets in your Siebel application 184 configuration parameters, modifying 104 configuring 110 configuring a new field for data matching 115 configuring multiple language support 141 configuring multiple mode support 145 configuring on UNIX 102 configuring on Windows 101 data synchronization between Siebel and 120 database user and table creation 97 identifying candidate records 29 installing 90 installing on UNIX 94 installing on Windows 92 loading Siebel data 106 set up process for data matching 90 workflow deployment and activation 105 Oracle Enterprise Data Quality configuring and administering 69 configuring for matching and address validation 70 configuring for matching and address validation, prerequisites 70 configuring Siebel server 72 configuring UNIX server 71 configuring windows server 70
S
searching duplicate records, process 220 seed data, for data quality 259 sequenced merges about 219 field characteristics 220 Siebel Business Applications, configuring for Oracle Data Quality Address Validation Server 155 Siebel Data Quality, license 18, 253 Siebel Server disabling data cleansing without restarting server 39 disabling data matching without restarting server 39 synchronization, configuring between Siebel and Oracle Data Quality Matching Server 120
T
third-party software Universal Connector, installing for use with 49 Universal Connector, using with 21 troubleshooting, data matching and data cleansing 231
U
Universal Connector about 21 architecture 22 business component, associating the connector to 52 data cleansing 25 data matching 26 dynamic link libraries (DLLs), supported 49 fields, mapping to Siebel business component fields. 55 identifying candidate records 29 installing, about 49 match key generation 28 match scores 30 new connectors, process of configuring 50 new connectors, registering 51 preconfigured field mappings for Oracle Data Quality Address Validation Server 157 preconfigured field mappings for Oracle Data Quality Matching Server 112 preconfigured vendor parameters for Oracle Data Quality Address Validation Server 157 preconfigured vendor parameters for Oracle
P
Parse method 230 potential duplicates 31
R
real-time mode data matching and data cleansing, about running 206 described 206 records data cleansing, disabling for records 43 duplicate records, filtering 220 duplicate records, merging 219, 221 duplicate records, process of filtering and merging 220 sequenced merges, about 219 sequenced merges, field characteristics 220
264
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
Index V
Data Quality Matching Server 112 third-party software, about installing 49 Universal Connector API batch mode data cleansing functions 248 connector initialization and termination functions 233 data cleansing and data matching algorithms 248 error message functions 238 parameter setting functions 235 real-time data cleansing functions 247 real-time data matching functions 239 sdq_close_session function 235 sdq_data_cleanse function 248 sdq_datacleanse function 247 sdq_dedup_realtime function 239 sdq_dedup_realtime_nomemory function 241 sdq_get_error_message function 238 sdq_init_connector function 234 sdq_init_session function 235 sdq_set_global_parameter function 235 sdq_set_parameter function 236 sdq_shutdown_connector function 234 session initialization and termination functions 234 UNIX Oracle Data Quality Matching Server configuration 102 Oracle Data Quality Matching Server installation 94 user preference options, setting 42 user properties
V
Value Match method called from example 227 data matching scenario 224 input property sets 225 output property sets 227 vendor parameters configuring 53 preconfigured, for Oracle Data Quality Address Validation Server 157 preconfigured, for Oracle Data Quality Matching Server 112 vendor properties business components 53 troubleshooting 232 vendors connector mappings, adding a field mapping 54 connector mappings, configuring for external vendors 54 data quality field mappings, about 53 rules for dynamic link libraries 233
W
windows configuring for real-time data matching 56 Oracle Data Quality Matching Server configuration 101 Oracle Data Quality Matching Server installation 92
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)
26 5
Index W
266
Data Quality Guide for Oracle Customer Hub Version 8.2 (8.1.1.7 Quick Fix 07AN)