Basic Features- Power Centre ,Power Mart
Informatica is one of the most powerful ETL tool with a
simple visual interface. You do most of the work by simply dragging and
dropping with your mouse in the Designer. This graphical approach makes it also
very easy to understand what is going on (it is "self-documenting" in
a sense).
It can communicate with all major databases, can
move/transform data between them. It can move huge volumes of data in a very
effective way. It can throttle the transactions (do big updates in small chunks
to avoid long locking and filling the transactional log).
It can effectively do joins between tables in different
databases on different servers. The tasks are performed by Informatica Server
(Unix or MS Windows). You get a client application called "Server
Manager" to work with the server.
You design your processes in a client application called
"Designer". This is where you you tell what the source databases and
tables will be, what will be the targets, and how you move/transform the data.
Informatica uses its own database called "Metadata
Repository Database", or simply a Repository. Repository stores the data
(rules) needed for data extraction, transformation, loading, and management.
You get a client application "Repository Manager" to work with the
repository.
Products:
Comprised of the flagship Informatica PowerCenter® and
Informatica PowerMart® software products, as well as expanded connectivity
solutions that extend even across firewalls, this data integration platform
helps companies integrate and analyze data from virtually any corporate system,
including ERP, CRM, supply chain, procurement, eBusiness, mainframe and legacy
systems. The Informatica platform provides real-time performance, scalability
and extensibility to handle the analytic requirements of even the largest
enterprises.
They have different products.
- Power Center
- Power Mart
- Power Connect
- Power Channel
- Power Plug
Power Center & Power Mart :
The most 2 important products are Power Center & Power
Mart. Power Mart is a subset of Power Center. Power Center has better load
balancing.
Power Center : is the world-standard enterprise data
integration platform for the creation and real-time management of data
warehouse infrastructures and analytic applications.
- Distributed metadata (Repository)
- Global & Local Repositories (can register more than one Informatica server & can share the information in the global repository). Multiple repositories, many servers.
- Can connect to varied sources like Peoplesoft, SAP etc.
- Cost is more
- ·Has bridges which can transport meta data from other tools (like Erwin)
Power Mart : is a departmental data integration platform and
can be expanded as a company's needs grow to an enterprise level with
Informatica PowerCenter 5
- Only Local Repository. (Can’t register other repositories). Single repository, one server.
- Cannot connect to varied sources like Peoplesoft, SAP etc.
- Cost is less
Mappings : defines
the flow of data from source to target database. A mapping contains more than
one
transformation. After getting the Source & Target, we
define the mappings. The definitions of the mappings are stored in the
repository.
Informatica Repository Server : A server used for managing
the information with in the meta data repository. All the clients will be
communicated with the Repository server. The information in the repository is
managed by Repository Server.
When ever we are working with client tool, the first task is
logon to repository. We need to have a valid user to logon. The communication
between the client tools & Repository Server is TCP/IP
Informatica Server : is responsible for loading of
data.Parts of the Informatica Server :
1,Load Manager
2. Data Transformation Manager
3. Reader
4. Writer
Informatica Repository : There are 2 types of repositories.
Global & Local repositories(with reference to Power Center). With Power
Mart only one local repository. The information of all local repositories are
located in Global repository. Shared information can be put in global
repository.
Folders : To organize the files (like containers). Use to
organize the information with in the repository. There can be one or more
folders with in a repository. Each folder is independent of the other. Whenever
we are going to work, work with in the folder. A folder contains a pre-defined
set of sub folders.
- Sources (Contains the source database object definitions)
- Targets (Target db object definitions)
- Dimensions (Dimension definitions)
- ·Cubes (Cube definitions)
- Transformations (contains reusable transformation definitions)
- Mapplets (A mapplet is a reusable part of a mapping which contains one or more reusable transformations. Contains the mapplet definitions)
- Mappings (Mapping definitions)
- ·Business Components (Business is classified as far as the business usage)
- Sessions (contains session definitions. A session is a task based on a mapping
No comments:
Post a Comment