• Türkçe
    • English
  • English 
    • Türkçe
    • English
  • Login
View Item 
  •   DSpace@Muğla
  • Fakülteler
  • Mühendislik Fakültesi
  • Bilgisayar Mühendisliği Bölümü Koleksiyonu
  • View Item
  •   DSpace@Muğla
  • Fakülteler
  • Mühendislik Fakültesi
  • Bilgisayar Mühendisliği Bölümü Koleksiyonu
  • View Item
JavaScript is disabled for your browser. Some features of this site may not work without it.

A selective approach to stemming for minimizing the risk of failure in information retrieval systems

Thumbnail

View/Open

Tam metin / Article (2.115Mb)

Date

2023

Author

Göksel, Gökhan
Arslan, Ahmet
Dinçer, Bekir Taner

Metadata

Show full item record

Citation

Göksel G, Arslan A, Dinçer BT. 2023. A selective approach to stemming for minimizing the risk of failure in information retrieval systems. PeerJ Computer Science 9:e1175 https://doi.org/10.7717/peerj-cs.1175

Abstract

Stemming is supposed to improve the average performance of an information retrieval system, but in practice, past experimental results show that this is not always the case. In this article, we propose a selective approach to stemming that decides whether stemming should be applied or not on a query basis. Our method aims at minimizing the risk of failure caused by stemming in retrieving semantically-related documents. The proposed work mainly contributes to the IR literature by proposing an application of selective stemming and a set of new features that derived from the term frequency distributions of the systems in selection. The method based on the approach leverages both some of the query performance predictors and the derived features and a machine learning technique. It is comprehensively evaluated using three rule-based stemmers and eight query sets corresponding to four document collections from the standard TREC and NTCIR datasets. The document collections, except for one, include Web documents ranging from 25 million to 733 million. The results of the experiments show that the method is capable of making accurate selections that increase the robustness of the system and minimize the risk of failure (i.e., per query performance losses) across queries. The results also show that the method attains a systematically higher average retrieval performance than the single systems for most query sets.

Source

PeerJ Computer Science

URI

https://doi.org/10.7717/peerj-cs.1175
https://hdl.handle.net/20.500.12809/10795

Collections

  • Bilgisayar Mühendisliği Bölümü Koleksiyonu [103]
  • WoS İndeksli Yayınlar Koleksiyonu [6466]



DSpace software copyright © 2002-2015  DuraSpace
Contact Us | Send Feedback
Theme by 
@mire NV
 

 




| Policy | Guide | Contact |

DSpace@Muğla

by OpenAIRE
Advanced Search

sherpa/romeo

Browse

All of DSpaceCommunities & CollectionsBy Issue DateAuthorsTitlesSubjectsTypeLanguageDepartmentCategoryPublisherAccess TypeInstitution AuthorThis CollectionBy Issue DateAuthorsTitlesSubjectsTypeLanguageDepartmentCategoryPublisherAccess TypeInstitution Author

My Account

LoginRegister

DSpace software copyright © 2002-2015  DuraSpace
Contact Us | Send Feedback
Theme by 
@mire NV
 

 


|| Policy || Guide|| Instruction || Library || Muğla Sıtkı Koçman University || OAI-PMH ||

Muğla Sıtkı Koçman University, Muğla, Turkey
If you find any errors in content, please contact:

Creative Commons License
Muğla Sıtkı Koçman University Institutional Repository is licensed under a Creative Commons Attribution-NonCommercial-NoDerivs 4.0 Unported License..

DSpace@Muğla:


DSpace 6.2

tarafından İdeal DSpace hizmetleri çerçevesinde özelleştirilerek kurulmuştur.