Privacy Overview
This website uses cookies to improve your experience while you navigate through the website. Out of these, the cookies that are categorized as necessary are stored on your browser as they are essential for the working of basic functionalities of the website. We also use third-party cookies that help us analyze and understand how you use this website. These cookies will be stored in your browser only with your consent. You also have the option to opt-out of these cookies. But opting out of some of these cookies may affect your browsing experience.
Always Active
Necessary cookies are absolutely essential for the website to function properly. These cookies ensure basic functionalities and security features of the website, anonymously.
Functional cookies help to perform certain functionalities like sharing the content of the website on social media platforms, collect feedbacks, and other third-party features.

No cookies to display.

Performance cookies are used to understand and analyze the key performance indexes of the website which helps in delivering a better user experience for the visitors.

No cookies to display.

Analytical cookies are used to understand how visitors interact with the website. These cookies help provide information on metrics the number of visitors, bounce rate, traffic source, etc.

No cookies to display.

Advertisement cookies are used to provide visitors with relevant ads and marketing campaigns. These cookies track visitors across websites and collect information to provide customized ads.

No cookies to display.

Other uncategorized cookies are those that are being analyzed and have not been classified into a category as yet.

No cookies to display.

Information Retrieval – A Multilingual Perspective

Since the advent of the World Wide Web (WWW), Information Retrieval (or IR, of which design of search algorithms and search engines is a major topic) has gained attention and popularity. One major impact of the WWW is that users have fundamentally changed their previous information seeking expectations and behavior: from “going to a library”, to expecting “the library comes to you”. A consequence is that foreign language resources of any country are often and easily available at one’s desktop. How does IR handle the situation?

This talk will give a brief history of IR research and describe how IR evolves to handle some of the Asian languages such as Chinese/Japanese/Korean for native speakers. For many studies or activities such as commerce, foreign affairs, science & technology, etc., westerners’ also need access to these foreign resources. To address this issue, IR has developed into CLIR – Cross-Lingual IR – by combining with automatic machine translation (MT). For example, users can input English queries to search and access Chinese documents efficiently and effectively. IR has been recognized as an important productivity enhancing tool since the web became popular in society, and CLIR is also considered significant for bridging the language gap between peoples.

Author Bio

Kui-Lam Kwok, a Professor of Computer Science at Queens College, got his B.Sc. from Hong Kong University and Ph.D. from Manchester University, England. His research interest is in Information Retrieval (IR) and associated topics such as automatic indexing, retrieval models, search methodologies. His work also includes translation/transliteration for cross language retrieval (CLIR): i.e. posing queries in one language to search documents in another – in particular for the English/Chinese language pair. IR has been recognized as an important productivity enhancement tool since web searching became popular in society, and CLIR is also considered highly significant for bridging the language gap between peoples.

About a decade and a half ago, the National Institute of Standards and Technology (NIST) has recognized the importance of IR, and designed "blind" retrieval environments called TREC (Text Retrieval Conference) for participants worldwide to experiment with their systems and algorithms in order to push IR technology forward. "Blind" means that before NIST's evaluation announcement in the annual TREC conference, no one knows the retrieval results. Prof. Kwok and his group participated in TREC continuously in the past years using his in-house developed PIRCS retrieval algorithm. New indexing techniques and retrieval approaches were researched and designed. These led to his participation returning top or near top automatic results in many years among many well known groups (see http://trec.nist.gov). PIRCS has been extended to support Chinese monolingual retrieval, which also won top submissions in TREC5 and 6.

In the past five years, the National Institute of Informatics (NII) at Tokyo has taken responsibility for Asian language retrieval from TREC, and initiated similar conferences called NTCIR (http://research.nii.ac.jp/ntcir-ws2 to ws4). Kwok and his group also participated in the last three (NTCIR-2 to 4), studying translation techniques and their influence on retrieval. They also returned top or near-top results for Chinese monolingual and English/Chinese CLIR among Asian participants.

Prof. Kwok's success in IR research and development has attracted over $1.5 million funding by U.S. government agencies and programs since 1992.