Increasingly, large enterprises are discovering that Apache Lucene and Apache Solr can help them dramatically reduce the costs of their most challenging indexing and search applications. Now, for the first time, there's a hands-on guide to using these technologies in enterprise environments. Packed with real-world examples and new best practices, Enterprise Lucene and Solr goes far beyond simply "getting started," to offer deep practical insights on planning, developing, and deploying highly-efficient solutions. This is practical, hands-on book. It is designed for people using Lucene and Solr in real-world, advanced applications. It covers topics such as enterprise search, spatial search, Machine Learning (specifically, text classification) and Big Data search. There is also a whole section devoted to what the author calls "productionalizing" Lucene and Solr, which will show you how to intelligently develop, deploy and maintain production installations. This book is divided into four parts. Part 1 covers the basics of Lucene.and includes a chapter covering the basics, and then two more covering indexing and querying in depth, with hands-on examples in between. Part II provides a similar foundation for Solr. Part III is where the fun begins, because we get to build some interesting applications that demonstrate the power of these technologies. And finally, Part IV, covers the bit that many people forget about: how to use these technologies in production.
This book is for developers who want to learn how to get the most out of Solr in their applications, whether you are new to the field, have used Solr but don't know everything, or simply want a good reference. It would be helpful to have some familiarity with basic programming concepts, but no prior experience is required.
The book is written as a reference guide. It includes fully working examples based on a real-world public data set.This book is for developers who want to learn how to use Apache Solr in their applications. Only basic programming skills are needed.
Is your organization rapidly accumulating more information than you know how to manage? This updated edition of Enterprise Search helps you create an enterprise search solution based on more than just technology. Author Martin White shows you how to plan and implement a managed search environment that meets the needs of your business and your employees. You'll learn why it’s absolutely vital to have a dedicated staff manage your search technology and support your users. New material for this second edition includes material on SharePoint 2013 search, managing open source search development, website search, designing the search user, and assessing search performance. Chapters now include a Further Reading section for computer science and information science students. Topics include: 10 critical success factors to assess organizational search maturity Essential skills needed to support a successful search application How to specify and manage open source search development How to manage SharePoint 2013 search Methods to assess the business impact of search Best practices in user interface design The importance of search for websites What to include in a search strategy
Build an enterprise search engine using Apache Solr: index and search documents; ingest data from varied sources; apply various text processing techniques; utilize different search capabilities; and customize Solr to retrieve the desired results. Apache Solr: A Practical Approach to Enterprise Search explains each essential concept-backed by practical and industry examples--to help you attain expert-level knowledge. The book, which assumes a basic knowledge of Java, starts with an introduction to Solr, followed by steps to setting it up, indexing your first set of documents, and searching them. It then introduces you to information retrieval and its implementation in Apache Solr; this will help you understand your search problem, decide the approach to build an effective solution, and use various metrics to evaluate the results. The book next covers the schema design and techniques to build a text analysis chain for cleansing, normalizing and enriching your documents and addressing different types of search queries. It describes various popular matching techniques which are generally applied to improve the precision and recall of searches. You will learn the end-to-end process of data ingestion from varied sources, metadata extraction, pre-processing and transformation of content, various search components, query parsers and other advanced search capabilities. After covering out-of-the-box features, Solr expert Dikshant Shahi dives into ways you can customize Solr for your business and its specific requirements, along with ways to plug in your own components. Most important, you will learn about implementations for Solr scoring, factors affecting the document score, and tuning the score for the application at hand. The book explains why textual scoring is not sufficient for practical ranking of documents and ways to integrate real-world factors for contributing to the document ranking. You'll see how to influence user experience by providing suggestions and recommendations. You'll also see integration of Solr with important related technologies such as OpenNLP and Tika. Additionally, you will learn about scaling Solr using SolrCloud. This book concludes with coverage of semantic search capabilities, which is crucial for taking the search experience to the next level. By the end of Apache Solr, you will be proficient in designing and developing your search engine.
With the intense interest in big data and the growing complexity of Apache Solr applications, application developers, business professionals, and end-users alike are clamoring for a more in-depth look at Apache Lucene and Solr. This comprehensive one-stop guide helps you gain a thorough understanding of Lucene’s underlying architecture so you can design, implement, and tune successful Solr applications. High-speed inverted indexes are inherently difficult to develop. That’s why more and more enterprises are implementing the Solr search server and Lucene Core search technology for complex text retrieval, as a NoSQL system for big data, or as a replacement for relational database systems that require horizontal scalability. With this guide’s complete coverage of both Lucene and Solr, you’ll get a unified view of their value and applicability to your big data projects. Learn how Lucene works from the inside out Get examples for using both Lucene and Solr APIs Configure Solr for optimal production use Learn how to use Solr with Hadoop
Successfully architect a Drupal 8 website that scales to meet project requirements of any size and scope. Starting with a one-chapter review of Drupal basics, you'll dive into deeper topics including software development processes, architecting a Drupal site, scaling Drupal, working effectively with themes, and more. In addition to a thorough discussion of custom module development and how to develop modules as building blocks, you'll also review many common ways of integrating Drupal with other 3rd party systems. Building and maintaining an enterprise-ready Drupal website presents a unique set of challenges and complexities. From development processes and content management to deployments, version control and more, all aspects of Drupal development are impacted when two or more developers are on a project. If you are involved in a Drupal project that requires frequent updates and long-term support from a team of developers, system administrators, and end-users, Enterprise Drupal 8 Development is for you. Whether you're an experienced Drupal developer looking to expand your skills, a systems administrator managing a Drupal project, or a PHP developer new to Drupal, Enterprise Drupal 8 Development will give you the knowledge and inspiration you need to manage large and complex projects. What You'll Learn Architect Drupal for development teams and apply best practices for large Drupal projects Integrate 3rd party applications with Drupal Manage content, code, and configuration across multiple staging environments Build a consistent user experience for administrators and content creators Who This Book Is For The primary audience is intermediate to advanced Drupal developers who are working on large-scale Drupal projects and advanced PHP developers new to Drupal The secondary audience is IT Professionals such as Systems Administrators managing Drupal projects
Solr (pronounced "solar") is an open source enterprise search platform, written in Java, from the Apache Lucene project. Its major features include full-text search, hit highlighting, faceted search, real-time indexing, dynamic clustering, database integration, NoSQL features and rich document (e.g., Word, PDF) handling. Providing distributed search and index replication, Solr is designed for scalability and fault tolerance. Solr is the second-most popular enterprise search engine after Elasticsearch. Solr runs as a standalone full-text search server. It uses the Lucene Java search library at its core for full-text indexing and search, and has REST-like HTTP/XML and JSON APIs that make it usable from most popular programming languages. Solr's external configuration allows it to be tailored to many types of application without Java coding, and it has a plugin architecture to support more advanced customization. This updated and expanded second edition of Book provides a user-friendly introduction to the subject, Taking a clear structural framework, it guides the reader through the subject's core elements. A flowing writing style combines with the use of illustrations and diagrams throughout the text to ensure the reader understands even the most complex of concepts. This succinct and enlightening overview is a required reading for all those interested in the subject . We hope you find this book useful in shaping your future career & Business.
This book is aimed at developers, designers, and architects who would like to build big data enterprise search solutions for their customers or organizations. No prior knowledge of Apache Hadoop and Apache Solr/Lucene technologies is required.