Activate 2018 has ended
Back To Schedule
Wednesday, October 17 • 4:15pm - 4:55pm
Automatically Build Solr Synonyms List Using Machine Learning

Log in to save this to your schedule, view media, leave feedback and see who's attending!

Synonyms list plays an important part for search. However, it usually take a long time to detect and maintain synonyms by the search or ontology group in a company. In this talk, we will discuss how to automatically detect synonyms from user click data and compare with popular methods such as word2vec (which is able to find related word but not nessisarily interchangeable for search purposes). We will also demo how to generate those analytical results and use them to improve search relevancy by a system, which combines the power of Solr with the power of a fast distributed compute engine like Apache Spark, to bring data science into production.

avatar for Chao Han

Chao Han

VP of Research, Lucidworks
Chao is a data scientist with over 10 years of analytical experience in both academia and industry. She got a PHD in Statistics from Virginia Tech in 2012 (with 8 publications). After graduation, she worked at JPMorgan Chase R&D supporting projects in the areas of transaction text... Read More →

Wednesday October 17, 2018 4:15pm - 4:55pm EDT
Salon 4&5