Wednesday, October 17 • 4:15pm - 4:55pm
Embracing Diversity: Searching over Multiple Languages

Log in to save this to your schedule, view media, leave feedback and see who's attending!

Although a lot of online content is written in English there're tons of non English users out there that still need to retrieve information. When searching, especially for tech related topics, it's common to compose queries in English; however for such users search results written in their own native language may be preferred.

We'll see how statistical machine translation tools can help in the above scenario to perform text translation at query time, resulting in an improved recall and precision for the search engine queries.

We'll look at how cross language information retrieval can be implemented on top of Apache Solr with the help of a Neural machine translation toolkit and also leverage Pointer-Generator Networks to summarize the retrieved and translated results from different sources.

The audience will gain a better understanding of how to be able to make search queries against a multilingual corpora indexed into Apache Solr and being able to retrieve all of the relevant search results in different languages.

avatar for Suneel Marthi

Suneel Marthi

Suneel is a Member of Apache Software Foundation and is a Committer and PMC on Apache Mahout, Apache OpenNLP, Apache Streams. He's presented in the past at Flink Forward, Hadoop Summit, Berlin Buzzwords, Machine Learning Conference, Big Data Tech Warsaw and Apache Big Data.
avatar for Jeff Zemerick

Jeff Zemerick

Cloud Architect, Mountain Fog
Jeff is a software engineer and cloud architect. Heis a committer and PMC on Apache OpenNLP. Jeff currently works onnatural language processing pipeline projects and resides outside ofMorgantown, WV.

Wednesday October 17, 2018 4:15pm - 4:55pm EDT
Drummond East