An information retrieval model based on interaction features and neural networks / Fadel Alhassan

Fadel , Alhassan (2019) An information retrieval model based on interaction features and neural networks / Fadel Alhassan. Masters thesis, University of Malaya.

[img] PDF (The Candidate's Agreement)
Restricted to Repository staff only

Download (212Kb)
    PDF (Thesis M.A)
    Download (900Kb) | Preview


      As the state-of-the-art for ad-hoc retrieval, the interaction-based approach represents the interaction between the query and the document through the semantic similarities of their words. The constructed interaction structure is then passed into a deep learning model for feature extraction which in turn are passed into another deep learning model for textual documents ranking. As far as we know, no study has yet identified how relevance matches may appear in the interaction structure and what features reflect that matches. Instead, the majority of the proposed models are based on the hypothesis that relevance matches are following some fixed visual patterns in the interaction matrix. Therefore, most of them are utilizing deep learning techniques for visual pattern recognition for features extraction. This features extraction approach affects the proposed models’ performance and simplicity. This work starts with an analytical study to identify a set of features called the interaction features which reflect how relevance matches may appear in the interaction matrix. Accordingly, a new approach for features extraction and documents ranking is proposed. Interestingly, the study found that the interaction features do not follow any specific visual pattern and therefore it suggests that deep learning techniques are not the most effective approach for the feature extraction task. Instead, a set of manually designed functions are proposed and a shallow neural ranking model was developed. The experiments results confirm the previous finding and show that, though less complex and more efficient, our model was able to outperform two baselines and give a close performance to the state-of-the-art model even without using some important IR factors like term importance.

      Item Type: Thesis (Masters)
      Additional Information: Dissertation (M.A.) – Faculty of Computer Science & Information Technology, University of Malaya, 2019.
      Uncontrolled Keywords: Information retrieval model; Interaction structure; Neural networks; IR factors; Extraction task
      Subjects: Q Science > QA Mathematics > QA75 Electronic computers. Computer science
      Divisions: Faculty of Computer Science & Information Technology
      Depositing User: Mr Mohd Safri Tahir
      Date Deposited: 06 Jan 2020 01:31
      Last Modified: 18 Jan 2020 10:35

      Actions (For repository staff only : Login required)

      View Item