Extracting and searching news articles in web portal news pages
by Namyun Kim
International Journal of Computational Vision and Robotics (IJCVR), Vol. 10, No. 3, 2020

Abstract: Recently, a large amount of news articles is being created online, and news articles are important resources for understanding social phenomena and trends. Accordingly, a web portal service provides a 'portal news page' that classifies news articles published from various news sources into sections and provides each news article with a certain structure. Therefore, by analysing portal news pages, it is possible to automatically extract information about news articles. In this paper, we introduce a prototype that extracts and searches key information of news articles for analysis. Specifically, we describe: 1) a crawler that collects, analyses and parses news articles; 2) an Elasticsearch server that indexes and searches news information; and 3) a front-end application that provides a search user interface. These systems are expected to provide the foundation for news analytics and forecasting services.

Online publication date: Mon, 11-May-2020

The full text of this article is only available to individual subscribers or to users at subscribing institutions.

 
Existing subscribers:
Go to Inderscience Online Journals to access the Full Text of this article.

Pay per view:
If you are not a subscriber and you just want to read the full contents of this article, buy online access here.

Complimentary Subscribers, Editors or Members of the Editorial Board of the International Journal of Computational Vision and Robotics (IJCVR):
Login with your Inderscience username and password:

    Username:        Password:         

Forgotten your password?


Want to subscribe?
A subscription gives you complete access to all articles in the current issue, as well as to all articles in the previous three years (where applicable). See our Orders page to subscribe.

If you still need assistance, please email subs@inderscience.com