Does WebSolr Support Asian Language Data?

Jim Leary's Avatar

Jim Leary

19 Jan, 2012 06:26 PM via web

We have a customer looking to use WebSolr and CloudBees with Asian Language data. Does WebSolr support it?

Thanks.

  1. Support Staff 2 Posted by Nick Zadrozny on 19 Jan, 2012 07:43 PM

    Nick Zadrozny's Avatar

    Hi Jim,

    Good question.

    We offer a fairly standard Solr install, which does have some capacity for
    useful Asian language analysis. For more advanced analysis, we are
    currently evaluating two additional packages for better Chinese analysis
    (Paoding, mmseg4j).

    In general, indexing and analyzing Asian language data is a relatively
    advanced subject, and we encourage our customers to do some research on
    Solr's capacities.

    If you like, we can also provide a recommendation for a consulting shop
    that specializes in Solr, has the necessary experience with Asian language
    indexing, and is familiar with our systems.

    —Nick

  2. Support Staff 3 Posted by Nick Zadrozny on 19 Jan, 2012 07:51 PM

    Nick Zadrozny's Avatar

    I should also mention, Solr by default includes Lucene's CJK analyzer
    library, and it would be worth reading up on its capabilities.

    —Nick

  3. Nick Zadrozny closed this discussion on 23 Feb, 2012 09:17 PM.

Comments are currently closed for this discussion. You can start a new one.