Saturday, July 21, 2012

Migrating to an InfoSphere Warehouse instance that is installed on a different computer

Migrating to an InfoSphere Warehouse instance that is installed on a different computer

You can migrate from Data Warehouse Edition V9.1.x to InfoSphere Warehouse when these products are installed on two separate computers. Assume that you want to migrate a Data Warehouse Edition V9.1.x instance that is installed on computer A to an InfoSphere Warehouse instance that is installed on computer B.
Before you begin
  1. Back up the metadata and scheduler databases on computer A.
  2. Copy all of the data warehouse projects that you want to migrate to computer B.
  3. Copy all of the deployed data warehousing applications from computer A to the same location on computer B.
    To deploy data warehousing applications, three directories are used:
    • Application home directory
    • Log directory
    • Working directory
    All the three directories must be created at exactly the same location on computer B. For example, if you have an application deployed on computer A at C:\application_dir\applicaton_1, then this directory becomes the application home directory. Assume that C:\log\ is the log directory, and C:\temp\working\ is the working directory. You must create these three directories on computer B, and copy all the files in these directories from computer A to computer B. Otherwise, the migrated application will not work. Note that each data warehouse application can have their own three directories. So, if you have 100 applications, there can be 300 different directories that you must copy to computer B. If some directories are not copied, you might see an error message in the migration log that indicates that certain files are missing during migration.
ProcedureTo complete the migration from computer A to computer B:
  1. Restore the metadata and scheduler databases on computer B.
  2. Migrate the WebSphere® application profile. For detailed help on migrating the WebSphere application profile that is located on a separate computer, see the WebSphere Information Center.
  3. Run the InfoSphere Warehouse Configuration Tool on computer B.
  4. Specify the migration settings in the migration.properties file.
  5. Run the migration script on computer B.

Post-migration tasks

Post-migration tasks

After migrating the data warehouse projects with the InfoSphere Warehouse Migration wizard, you must perform some post-migration tasks to ensure that you do not face errors when using the migrated data warehouse projects in the Design Studio. You must also specify the install location of the InfoSphere Warehouse in the config.properties file before you start using the Administration Console.
Before you begin
  • Migrate the data warehouse projects from Data Warehouse Edition V9.1.x to InfoSphere Warehouse.
ProcedurePerform the following post-migration tasks:
  1. Switch to a new workspace for working with the migrated projects in the Design Studio, Version 9.5.1
  2. Import the migrated projects into this new workspace:
    1. Click File > Import. The Import window opens.
    2. Click General > Existing Projects into Workspace and then click Next.
    3. Browse to the directory that contains the migrated projects.
    4. Select the projects that you want to import in the workspace and click Finish. The imported projects are displayed in the Data Project Explorer.
    Recommended: After migrating the old Data Warehouse Edition V9.1.x projects to InfoSphere Warehouse, import the migrated projects to a new workspace before you start using these projects in InfoSphere Warehouse. If you accidentally or intentionally import the migrated projects to an old workspace that was used in Data Warehouse Edition V9.1.x, you might not see some of the views in the BI perspective or the new menu shortcuts (for example, Workload Management, Text Analysis, and so on). Therefore, to continue using the old workspace, you must perform one of the following two steps:
    • Reset the BI perspective by selecting Window > Reset Perspective.
    • Select Window > Customize Perspective and select the shortcuts that you would like to see in the various menus of the BI perspective.
  3. Re-create the database connections in the Database Explorer view. The migration wizard does not migrate the workspace preferences and the database connections to the destination directory. Only the data warehouse projects that are contained in the source directory are migrated. Therefore, you must re-create the database connections in the Database Explorer view.
  4. If you migrated mining flows that are contained in your data warehouse projects, then you must restore the links to the database connections that are used by the mining flows. Perform one of the following two steps:
    • In the Data Project Explorer, right-click the Mining Flows folder in your data warehouse project. Select Set Online Database and then select the mining flows for which you want to restore the database links. Design Studio sets the online database to the same value as the SQL execution database for each of the selected mining flows.
    • Alternatively, you can right-click the name of mining flow and select the Set Database option. Then, select the name of the database connection and click OK. This is not a convenient method to restore database links, particularly when you have to restore links to several database connections.
  5. Before you start using the InfoSphere Warehouse Administration Console, perform these steps:
    1. Open the config.properties file that is located in the InfoSphere Warehouse installation directory (product_installation_directory\DWEAdmin\lib\custom).
    2. Add the keyword dwe.installLocation and specify the complete path to the product installation directory.
      For example, dwe.installLocation = C\:Program Files\IBM\dwe95
      Remember: The keyword dwe.installLocation is case sensitive.
    3. Restart the WebSphere® Application Server.

Migrating data warehouse projects and OLAP metadata

Migrating data warehouse projects and OLAP metadata

You can migrate data warehouse projects and OLAP metadata from within the Design Studio by using the InfoSphere Warehouse Migration wizard.
Before you begin
Ensure that:
  • You installed the Design Studio, migration tool plug-ins, and the SQL Warehousing Tool plug-ins of InfoSphere Warehouse.
  • The common repository database is created on the DB2® V9.5 database server.
  • The projects to be migrated are from DB2 Data Warehouse Edition V9.1.x.
About this task
You can migrate the following components by using the InfoSphere Warehouse Migration wizard:
  • Data warehouse projects (SQW and mining flows)
  • OLAP metadata in databases
You can migrate one component at a time or both of the components together. In addition, you can run the wizard multiple times on different data warehouse projects. For example, if you have 100 data warehouse projects to migrate, you can choose to migrate 50 projects in the first run, and then run the migration wizard again to migrate the remaining 50 projects. You don't need to migrate all of the projects at the same time.
Remember: Do not run the Migration wizard more than once to migrate the OLAP metadata in databases, if the metadata was migrated successfully in the first run. You will receive an error when you run the Migration wizard again to migrate the metadata. However, if the first run fails, then the Migration wizard rolls back the migration process and you can run the wizard again.
Procedure
To migrate data warehouse projects and OLAP metadata:
  1. Start the InfoSphere Warehouse Design Studio by using a new workspace. You will use this workspace for migration purposes only.
  2. In the Database Explorer view, create a connection to:
    • The database that you will use to migrate the OLAP metadata from.
    • The databases that are used in the data warehouse projects.
  3. From the main menu, click Data Warehousing > InfoSphere Warehouse Migration. The Migration wizard opens.
  4. In the Component Selection page, perform these steps:
    1. Select the components to migrate.
    2. Specify the location to save the migration log. By default, a MigrationLog.txt file is created in your current workspace directory.
    3. Click Next.
  5. To migrate data warehouse projects:
    1. In the Project Selection page, click Add. The Project Selection window opens.
    2. In the Source directory field, click the ellipsis (...) button and browse to the directory that contains the data warehouse projects to be migrated. This can be any directory where the projects are stored. It need not be a genuine Eclipse workspace directory. Ensure that you have read permission on this directory. The list of projects that are stored in the selected directory is displayed in the Select Projects area.
      Note: The directories must be real local file system directories. If the projects are stored in a version control system, such as CVS, ClearCase®, and so on, you must first check out the files and copy them to the local file system. Then, after the running the Migration wizard, check in the migrated files to the version control systems, if needed.
    3. In the Select projects area, select one or more data warehouse projects to migrate. By default, the wizard selects all the projects in the workspace for migration.
    4. In the Destination directory field, click the ellipsis (...) button and browse to a location on your local computer where you want to store the migrated project. By default, the destination directory is the same as the source directory but with a _95 suffix added to it. Ensure that you have write permission on this directory. For example, if the name of the source directory is v912DWHProjects, then the default name of the destination directory is v912DWHProjects_95.
      Note: If the name of the destination directory is the same as the source directory, then the old data warehouse projects are overwritten, and you can immediately use the migrated projects without importing the migrated projects into the workspace.
    5. Click OK. The source and destination directories are displayed on the Project Selection page.
  6. To migrate OLAP metadata:
    1. Select the databases that are to be used for migration.
    2. Optional: If you did not create a connection for the databases, click New Connection and enter the database details to create a connection. Click Next.
    3. Specify the connection information for the common repository database that you created during the InfoSphere Warehouse installation. Test the repository connection.
  7. Click Finish. Based on your selection for migration, the data warehouse projects or OLAP metadata is migrated. The data warehouse projects are migrated to the destination directory, and the OLAP metadata is migrated to the common repository database. A Migration Log window opens and displays the migration summary.
After a successful migration, the old data warehouse projects in V9.1.x are not modified, unless you select the destination directory that is the same as the source directory. The existing OLAP metadata is dropped from the database.

Informatica in DWH

Accelerate Deployment, Lower Costs and Risk of Enterprise Data Warehousing

The Informatica solution for enterprise data warehousing is proven to help IT departments implement data marts and departmental data warehouses and readily scale them up to enterprise data warehousing environments. This solution serves as the foundation for all data warehousing and enterprise data warehousing projects. It accelerates their deployment, minimizing costs and risks, by ensuring that enterprise data warehouses are populated and maintained with trustworthy, actionable, and authoritative data.

A Proven Solution for Enterprise Data Warehousing

The Informatica solution for enterprise data warehousing empowers your IT team to easily adapt to changing business requirements. With the Informatica solution for enterprise data warehousing, your IT organization can provide your business with the data it needs for:
  • Deeper competitive insights
  • Faster decision-making abilities
  • More comprehensive visibility across business units
  • Greater transparency for regulatory compliance
The solution consists of:
  • The Informatica Platform for accessing, discovering, cleansing, and integrating all enterprise data from a single, unified platform that's secure, reliable, and highly scalable
  • Informatica Velocity, a proven implementation methodology that aligns IT teams with the business to implement data warehousing and enterprise data warehousing projects consistently, rapidly, and cost-effectively
  • Informatica Professional Services to help your IT organization quickly configure data warehousing and enterprise data warehousing projects for maximum flexibility

Informatica Solution for Enterprise Data Warehousing Key Features

  • Universal access to all enterprise data—regardless of location, format, volume, latency, or type—including transaction and interaction data
  • Self-service to help architects, developers, analysts, and developers collaborate effectively on data warehousing and enterprise data warehousing projects
  • Integrated data quality to ensure that accurate and trustworthy data populates data warehousing and enterprise data warehousing projects
  • Master data management to handle dimensions and hierarchies of data so that your enterprise data warehouse delivers an authoritative view to business intelligence and analytics
  • Metadata management to ensure that data used in your data warehousing and enterprise data warehousing projects  is fully auditable and traceable from end to end
  • Granular user management and security to enforce segregation of duties for compliance purposes and to manage enterprise data warehousing work effectively among teams
  • Enterprise grid and pushdown optimization to increase data warehousing and enterprise data warehousing speed, efficiency, and reliability—from departmental reporting to Big Data analytics
  • Real-time data integration to deliver data from your data warehouse in batch, changed data, and real-time formats
  • Data services to augment data warehousing and enterprise data warehousing and deliver trusted data either physically or virtually to any application, at any latency, using any protocol
  • Test data management to dramatically shrink the footprint of nonproduction systems
  • Hybrid platform to accelerate deployments of business intelligence, data warehousing, and enterprise data warehousing environments by populating them with on-premise or cloud-based data
  • Data Archive for Data Warehouse to provide visibility to how data is used in the data warehouse, manage explosive data growth, optimize data warehouse infrastructure and performance, and reduce overall costs.