[month] [year]

ACM Conference on Web Search and Data Mining (WSDM-2019)

Nisarg Jhaveri working under the supervision of Prof. Vasudeva Varma presented a paper on clstk: The Cross-Lingual Summarization Tool-Kit at the 12th ACM Conference on Web Search and Data Mining (WSDM-2019) from 11 – 15 February at Melbourne, Australia. The authors of this paper are Nisarg Jhaveri, Manish Gupta (Microsoft, Hyderabad) and Prof. Vasudeva Varma.

Cross-lingual summarization (CLS) creates summaries in a target language, from a document or document set given in a different, source language. Cross-lingual summarization can play a critical role in enabling cross-lingual information access for millions of people across the globe who do not speak or understand languages having large representation on the web. It can also make documents originally published in local languages quickly accessible to a large audience which does not understand those local languages. Though cross-lingual summarization has gathered some attention in the last decade, there has been no serious effort to publish rigorous software for this task. In this paper, we provide a design for an end-to-end CLS software called clstk. Besides implementing a number of methods proposed by different CLS researchers over years, the software integrates multiple components critical for CLS. It is hoped that this modular tool-kit will help CLS researchers to contribute more effectively to the area.

WSDM (pronounced “wisdom”) is one of the premier conferences on web-inspired research involving search and data mining. WSDM is a highly selective conference that includes invited talks, as well as refereed full papers. WSDM publishes original, high-quality papers related to search and data mining on the Web and the Social Web, with an emphasis on practical yet principled novel models of search and data mining, algorithm design and analysis, economic implications, and in-depth experimental analysis of accuracy and performance.