Does WebSolr Support Asian Language Data?

Jim Leary's Avatar

Jim Leary

19 Jan, 2012 06:26 PM via web

We have a customer looking to use WebSolr and CloudBees with Asian Language data. Does WebSolr support it?

Thanks.

  1. Support Staff 2 Posted by Nick Zadrozny on 19 Jan, 2012 07:43 PM

    Nick Zadrozny's Avatar

    Hi Jim,

    Good question.

    We offer a fairly standard Solr install, which does have some capacity for
    useful Asian language analysis. For more advanced analysis, we are
    currently evaluating two additional packages for better Chinese analysis
    (Paoding, mmseg4j).

    In general, indexing and analyzing Asian language data is a relatively
    advanced subject, and we encourage our customers to do some research on
    Solr's capacities.

    If you like, we can also provide a recommendation for a consulting shop
    that specializes in Solr, has the necessary experience with Asian language
    indexing, and is familiar with our systems.

    —Nick

  2. Support Staff 3 Posted by Nick Zadrozny on 19 Jan, 2012 07:51 PM

    Nick Zadrozny's Avatar

    I should also mention, Solr by default includes Lucene's CJK analyzer
    library, and it would be worth reading up on its capabilities.

    —Nick

Reply to this discussion

Internal reply

Formatting help or Preview

Attached Files

You can attach files up to 10MB

What is fifteen divided by three?

If you don't have an account yet, we need to confirm you're human and not a machine trying to post spam.