Real time Analytics-Implementing a lambda architecture on Hadoop - Part 2
Hbase-Lily Indexer- Indexing data from Hbase to Solr by configuration This is the second part of my 3 part blog series to achieve real time analytic capability. In this blog focus is to index data from Hbase to Solr just by configuration and very less development. If you have a web or mobile app it is nice to have a search capability on data- in order to achieve fuzzy search capability we use Solr. Since we already loaded data to Hbase as a part of ETL using Spark it is not necessary to have another ETL process to load Solr. Lily Indexer is useful in indexing the data added/updated/deleted in Hbase database to Solr collection. This syncs the data in near real time. Indexing allows you to query data stored in HBase with the Solr service. The indexer supports flexible, custom, application-specific rules to extract, transform, and load HBase data into Solr. Solr search results can contain columnFamily:qualifier links back to the data stored in HBase. ...