Pricing Information

Products Security | Scalability and Performance | Global Language Support
Overview
Related Events
Related Case Studies
Related Resources
Related News
"IDOL offers a broad set of capabilities for content classification. IDOL supports advanced techniques for automatic content categorization like vector-based taxonomies, concept modeling, and entity extraction"
The Forrester WaveTM Enterprise Search Platforms, Matthew Brown

Scalability and Performance

The management of structured and unstructured content requires a platform that can meet the most rigorous performance requirements and be easily resized commensurate to business needs. IDOL scales to support the largest enterprise-wide and portal deployments in the world, with presence in virtually every vertical market. Since IDOL's scalability is based on its modular, distributed architecture, it can handle massive amounts of data on commodity dual-CPU servers. For instance, only a few-hundred entry-level enterprise machines are required to support ChoicePoint's 10 billion record footprint. By comparison a competitor uses 150,000 machines to handle the same amount of data.

A single IDOL engine can:

Support over 470 million documents on 64-bit platforms
Accurately index in excess of 110 GB/hour with guaranteed index commit times (i.e. how fast an asset can be queried after it is indexed) of sub 5ms
Execute over 2,600 queries per second, with subsecond response times on a single machine with two CPUs when used against 70 million pieces of content, while querying the entire index for relevant information
Support hundreds of thousands of enterprise users, or millions of web users, accessing hundreds of terabytes of data
Save storage space with an overall footprint of less than 15% of the original file size

This enhanced scalability results in hardware cost-savings as well as the ability to address larger volumes of content. Though IDOL scales extremely well on commodity servers, its flexible architecture can take full advantage of massive parallelism, SMP processing capabilities, 64-bit environments (such as Intel Itanium 64-bit architecture), software platforms (such as Solaris 10, Linux 64, Win64, etc), distributed server farms, and all common forms of external disk arrays (i.e. NAS, SAN etc) to further improve performance. This flexibility extends to being able to leverage one or a combination of these different environments.

How It Works

Content from various repositories is aggregated by connectors and then indexed into the IDOL Server or for dissemination across multiple IDOL Servers, through the Distributed Index Handler (DIH). The DIH can efficiently split and index copious quantities of data into multiple IDOL Server instances, optimizing performance by batching data, replicating all index commands and invoking dynamic load distribution. The DIH can perform data-dependent operations, such as distributing the content by date, which allows for more efficient querying. Performance is augmented by the Distributed Action Handler (DAH), a distribution server that allows the user to distribute action commands, such as querying, to IDOL Servers. Multiple copies of IDOL Servers to which the DAH propagates actions further ensure uninterrupted service in the event of server failure. For flexibility, both the DAH and the DIH can be configured to run in mirroring mode (IDOL Servers are exact copies of each other) and non-mirroring mode (each IDOL Server is configured differently and contains different data). In addition, the Distributed Service Handler (DiSH) component allows effective auditing, monitoring and alerting of all other Autonomy components.

Linear Scalability

Performance and capacity can be doubled by simply replicating the existing machine. This allows scaling predictions to be made without worrying about bottlenecks.

Load Balancing

Data is automatically replicated across multiple servers and user requests are load-balanced across these replicas, guaranteeing performance, reducing latency and improving user-experience.

Mirroring / Failover

Automatically generated replicas are used to provide a pool of servers, the primary resource is automatically selected and the system switches to secondary systems if it fails so that service continues uninterrupted.

Distribution

For organizations that are geographically distributed, local replicas are automatically created and utilized where possible. Remote copies are only used when a local system fails, thereby building fault tolerance, the benefits of local performance and a reduction of resource overhead into a single, seamless service.

Adaptive Probabilistic Concept Caching

Frequently used concepts are maintained in memory and query results are returned as quickly and efficiently as possible.

Multi-dimensional Index & Query Throttling

By using a multi-dimensional index to provide valuable information to the distribution components, IDOL precludes bottlenecks and unbalanced peak loads during the indexing and query process.

Autonomy provides prioritized throttling based on:

Time: maximize index/query performance based on the time of day (i.e. work hours)
Location: prioritize activity based on the server landscape
Status: arbitrarily assign prioritized status for processing
"We have worked with Autonomy for a number of years due to their ability to offer a next-generation enterprise search platform that doesn't necessitate a trade-off between performance, security and scalability."
Mr. K. Sriram, Senior Vice President, Satyam Consulting and Enterprise Solutions Practice, 2007

Intelligent Repository Storage

Autonomy allows for the scheduled or discrete transfer of assets from higher stages (powerful, expensive hardware) to lower stages (cost-effective hardware) as the importance of assets declines over time. This potentially maximizes the value of information by making it available through more adequate hardware for a period of time, then again maximizing the existing investment in hardware by archiving information to machines better suited to that task.

Context and Concept Sensitive Indexing

Autonomy prioritizes the indexing queue using all contextual information, including user profiles and their usage of content. IDOL is aware of where the most important content is stored based on intelligent indexing, and performance is optimized at querying time.

Dynamic Storage Structures and Resource Sensitive Mode

Autonomy's distribution components are self-moderating and automatically re-distributes components based on peak hours and server. As an example, the DIH would not add new content to a full server but only send updates.

Advanced Data Transmission

In the information management context, data such as queries and results are travelling around the enterprise infrastructure. This information can impact the transport layers in which it rides. Recognizing this fact, and in an effort to save bandwidth, Autonomy has developed several mechanisms to reduce and minimize the impact on the network, one of which being a proprietary compression algorithm that reduces even further IDOL's low hardware requirements.

Distributed Link

Autonomy optimizes the streaming of rich media assets over high latency links, removing requirements for high performing speed and bandwidth.

Distributed Hot Deployment Mode

Hot deployment modes within the IDOL management units and server infrastructure allow software to be deployed in live environments and new requirements for areas such as functionality, resilience and scalability to be met without any disruption or downtime.

Distributed Abridged Mode

Autonomy ensures that the size of the installation does not impact performance or index cost. Autonomy's Distributed Abridge Modes apply to systems in which result sets of several million documents are routinely returned as part of a query. These modes allow for the overall size of the data that is returned across the network or transferred between components of the distributed system to be compressed and reduced to a minimum expression, achieving reduced transport and latency.

There do not seem to be any upcoming events related to this page at the moment, please visit our seminar and webinar pages for more information.

There do not seem to be any case studies related to this page available yet, however please visit our publications site at http://publications.autonomy.com/ for more information.

Products Security | Scalability and Performance | Global Language Support
+1 415 243 9955
Further References:
Discover More...

Company
Technology
Products
Functionality
Business Solutions
Services
Customers
Partners & OEMs
News & Events
Investors