• Products
    • Overview
    • LucidWorks Search Platform
      • Features and Benefits
      • Technical Overview
      • Only with LucidWorks
      • LucidWorks and Solr
      • White Papers
      • LucidWorks Enterprise
      • LucidWorks Cloud
    • Certified Distributions
      • Certified Solr
      • Certified Lucene
    • Apache Releases
      • Apache Solr
      • Apache Lucene
  • Support & Services
    • Overview
    • Support
    • Training
    • Solr/Lucene Certification
    • ExpertLink Advisory
    • Consulting
    • Partners
    • Subscriptions
  • Why Lucid?
    • Why Lucid?
    • Technology
    • Who uses Lucene/Solr?
      • What customers are saying
    • Case Studies
    • Whitepapers
    • Demos
    • Webinars
  • Blog
  • DevZone
    • DevZone Overview
    • Forums (LWE)
    • Videos & Podcasts
      • How To's
      • Screencasts
      • Podcasts
      • Conference Videos
    • Technical Articles
      • Whitepapers
    • Reference Materials
      • Documentation
      • Solr Reference Guide
      • Solr & LucidWorks Matrix
      • Tutorials
    • Events
      • Conferences
      • Meet Ups
    • Code & Test
  • Downloads
  • About Us
    • Management
    • Board of Directors
    • Apache Lucene/Solr Committers
    • Careers
    • News
      • Media Coverage
      • Press Releases
    • Contact Us
Sign Up or Log In
Home

  • Overview
  • LucidWorks Search Platform
    • Features and Benefits
    • Technical Overview
    • Only with LucidWorks
    • LucidWorks and Solr
    • White Papers
    • LucidWorks Enterprise
    • LucidWorks Cloud
  • Certified Distributions
    • Certified Solr
    • Certified Lucene
  • Apache Releases
    • Apache Solr
    • Apache Lucene

LucidWorks | Technical Overview

Download the Datasheet

For developers and architects of search applications, LucidWorks offers the best in open source power and flexibility available from Apache Lucene/Solr – without having to become a Lucene/Solr expert first. As the world's leading experts in open source search, we bottle the very best that the community process has to offer – and then some.

Because it adds powerful, intuitive tools for configuration, deployment, content acquisition, security, and search experience – all in a convenient, well-organized package – LucidWorks simplifies and accelerates search application development, getting you faster, more secure access to your content. And thanks to the economies of open source, you can scale to billions without spending millions.

As the cutting edge search platform that delivers scalable access to all data big and small, LucidWorks lets you build the killer apps that transform the unending stream of data and content into scalable, versatile, actionable information.

Click here for screenshots

Lucene/Solr 4.x, for power and innovation


Click image to enlarge
  • Near-Real-Time Search: Flexible control to rapidly update and delete index data in segments; dramatically reduces the time to search-readiness for newly indexed documents
  • Massively Scalable through Distributed Indexing: Centralize configuration and deployment of Solr clusters to deliver search integrity and consistency across all servers in the cluster, including load balancing and failover for query traffic, using Apache Zookeeper to coordinate distributed instance configuration
  • Faster Fuzzy Queries: Provides an order of magnitude performance improvement using advanced distance algorithms and finite state automata to narrow index scans, and avoid the need to scan large index ranges to identify matches
  • Complete, Integrated, Tested Apache Solr: Adds cutting edge Lucene/Solr 4 innovations over and above the full set of features Apache Lucene/Solr 3.x, which included sorting by function query, field collapsing, extended dismax query parser, UIMA integration, and more
  • Open Source Transparency and Flexibility: provides unmediated visibility to construction of underlying search processes and algorithms, to make control of underlying search routines and results as transparent as possible 

 

Streamlined Search Configuration, Deployment and Operations

  • Intuitive, simplified control of search quality and user behaviors: Admin UI delivers well-structured, streamlined control of configuration and tuning for crawlers and index fields, stopwords, boosting, stemming, field parameters and settings, faceting options, as well as user management, security, and user experience options
  • Search overview dashboard: Tracks query throughput, indexing throughput, most popular and most recent queries, and more
  • Ongoing search optimization, tuning and configuration: Inspects content and field types, to easily select search parameters and configuration options that deliver improved relevancy and user-visible search quality
  • ReST API better automates integration of search as a service: Easily automates search operations, configuration and optimization through well-abstracted lightweight interfaces
  • Keep pace with constant change: Sensitive configuration files are accessed through robust REST interfaces, to avoid introducing error conditions and misconfigurations that can interfere with search
  • Smart defaults for ease-of-use to reduce errors: Minimizes tedious, error-prone and labor intensive manual editing of text and XML configuration files
  • Standardized interfaces for monitoring: Monitoring API integrates with common commercial application management tools, along with Open Source tools such as JMX, Zabbix and Nagios, delivering integrated application-level statistics and performance data for your search application
  • Proactive performance management: Configures parameters and thresholds as triggers to issue alerts for performance and other service-level issues
  • Built-in upgradeability: Automatically captures and redeploys critical schema and configuration data for Solr and LucidWorks to streamline and simplify upgrades, without incurring cost of re-constructing indexes or intricate configurations and schemas
  • Built-in log search and indexing tools: Indexes its own search application logs, to simplify analysis and reporting on query and usage patterns, as well as to identify application errors, warnings, and job status or progress
  • Easy to install and deploy: Installer can get you searching in minutes; generates an automated installation script for playback on slave nodes in multi-server master/slave configurations

Broad-based content acquisition

  • Hadoop HDFS connector: Traverses HDFS file system hierarchy according to specified parameters and retrieves files for indexing, with support for HDFS permissions to enable robust, efficient search processing of files in the Hadoop cluster
  • Amazon EC2 S3 connector: Configures search to access resources in specified buckets under specific paths, enabling it to traverse, index and retrieve data and documents available based on AWS credentials
  • Sharepoint connector: Crawls and indexes the content of a SharePoint server as well as Windows shares and the Access Control Lists (ACLs) associated with shared files and directories, including early-binding of document security attributes
  • Easily indexes databases, XML, JSON, HTML, filesystems: Streamlines data/document acquisition pipeline to easily add new collections from any data source; easily set sources, schedules, credentials, indexing policies, permissions, and more
  • Split crawl enables iterative content processing: Decouple data access from parsing and from indexing; elements of the crawl sequence can each be used iteratively or in batch, without recurring full content processing costs; for example, a change in configuration file that triggers a change in index structure can use the output of earlier content parsing without crawling source data anew
  • Build custom connectors: APIs and Admin UI let you build your own custom connectors and publish their configuration, to search any of data sources supported by Google Connector Manager, Aperture, and/or TIKA
  • Support for popular file formats: Index and search files from Microsoft Office, Adobe PDF, and other common formats; extract documents from local or remote disks, databases, and web sites, or use native XML format; additional options for file readers and document filters also available

Versatile access and data security

  • End-to-end Security Policy Management: Integrated management of security including user and document controls, optional encrypted communications, differentiated search and indexing, across different models such as LDAP and Sharepoint ACL
  • Source http Authentication: Manage credentials centrally and securely from the Admin UI
  • Encrypted Communication Options: Encrypts “clients to server” AND “server to server” communications, between distributed indexing and query servers, plus secure replication process for distributed search setup
  • LDAP-aware: Support for integrating user authentication with an existing LDAP system, including authentication and authorization of users as well as user-to-group mapping

Advanced Search Experience easily integrated into your application and infrastructure

  • REST APIs: Automates remote access and operation, exposing all LucidWorks functionality via API; programmatically manage tasks such as creating and managing data sources, content acquisition, setting field behaviors, monitoring search and infrastructure operations, and more, supporting both conventional on-premise deployment as well as cloud-based DevOps
  • Accelerate Development with Example Client Libraries across Platforms: Packaged examples integrate applications from different languages and environments more easily with Solr, including .Net, PERL, python. Java execution environment can be hosted on virtually any server environment or cloud infrastructure
  • Query Parsing Enhancements: Deliver a more resilient, richer user experience with new query operators and better stop word and synonym handling; provides a simpler, more forgiving, and intuitive end user search experience, improving odds of turning user inputs into quality results
  • Click Scoring Relevance Framework: Suite of components and tools for adjusting results ranking based on analysis of historical click-through log data; tracks which results were selected by users for individual queries, providing relevance boosting based on document popularity
  • User Alerts: Automated notifications keep users up to date as new search results are available; end users can select, define, and manage any valid search query as an alert
  • Integrated Spell Checking: Accelerate spellchecking and term validation with advanced editorial distance algorithms
  • Integrated Auto-complete: Create unique auto-complete indexes specific to individual document collections to fine-tune user experience

 

 


Next Steps

Request more Info
Have Sales contact me
How to Buy: Subscriptions

Get Started

White Paper: Get Started with LucidWorks
Webcast: Migrate to Open Source Search

DevZone

Latest Blog Post

Indexing with SolrJ
Two popular methods of indexing existing data are the Data Import Handler (DIH) and Tika (Solr Cell)/ExtractingRequestHandler. These can be used to index data from a database or...
  • Tutorials
  • Blog
  • Whitepapers
  • Docs
  • Forums
  • Support
Share
Follow Facebook Twitter LinkedIn YouTube
RSS Feed
  • Contact Us
  • About Lucid Imagination
  • Help & Support
  • Training
  • Website Feedback
  • Privacy Policy
  • Legal Terms of Use
  • Copyrights and Disclaimers
  • Sitemap
  • Admin

Apache Solr, Solr, Apache Lucene, Lucene and their logos are trademarks of the Apache Software Foundation.

© 2012 Lucid Imagination. All Right reserved.