Home
International Journal of Science and Research Archive
International, Peer reviewed, Open access Journal ISSN Approved Journal No. 2582-8185

Main navigation

  • Home
    • Journal Information
    • Abstracting and Indexing
    • Editorial Board Members
    • Reviewer Panel
    • Journal Policies
    • IJSRA CrossMark Policy
    • Publication Ethics
    • Issue in Progress
    • Current Issue
    • Past Issues
    • Instructions for Authors
    • Article processing fee
    • Track Manuscript Status
    • Get Publication Certificate
    • Become a Reviewer panel member
    • Join as Editorial Board Member
  • Contact us
  • Downloads

ISSN Approved Journal || eISSN: 2582-8185 || CODEN: IJSRO2 || Impact Factor 8.2 || Google Scholar and CrossRef Indexed

Peer Reviewed and Referred Journal || Free Certificate of Publication

Research and review articles are invited for publication in March 2026 (Volume 18, Issue 3) Submit manuscript

Stacked ensemble improvement of phishing Email corpus detection based on frequency-based count vector embedding

Breadcrumb

  • Home
  • Stacked ensemble improvement of phishing Email corpus detection based on frequency-based count vector embedding

Olayemi Olasehinde 1, Olayemi Olufunke Catherine 2 and Peter Adetola Adetunji 3, *

1 Department of Computing and Engineering, University of Huddersfield, UK.
2 Department of Computing and Games, Teesside University, Middlesborough, UK.

Research Article
 

International Journal of Science and Research Archive, 2024,13(02), 3774-3788.
Article DOI: 10.30574/ijsra.2024.13.2.1830
DOI url: https://doi.org/10.30574/ijsra.2024.13.2.1830

Received on 18 November 2024; revised on 26 December 2024; accepted on 28 December 2024

Email users are at risk from phishing attacks, which utilize a combination of technological and social engineering techniques to obtain sensitive information from targets and cause significant financial loss. It is the fastest-rising online crime for stealing personal and financial data. In this work, natural language processing was applied to process an unstructured email corpus and convert it to a word vector matrix suitable to build machine learning models implemented using the Python programming language. The test corpus was evaluated using the four base models, and the results indicate that the random forest model had the highest accuracy (92.71%), closely followed by the logistic regression model (89.01%), the Naive Bayes recoded model (83.52%), and the KNN model (79.95%) with the lowest accuracy. A notable improvement in classification accuracy and a decrease in the false alarm rate observed by all base models were demonstrated by the stacked ensemble evaluation of the base model predictions, which yielded an accuracy of 97.14%. It recorded a classification improvement of 21.5%, 5.4%, 16.3%, and 9.1% over the KNN, RF, NB, and LR models, respectively, and a drop in false alarm rate by 79.0%, 36.0%, 76.4%, and 64.0% over the KNN, RF, NB, and LR models, respectively. The implementation of this approach on the mail server to filter incoming phished emails.

Identity theft; Corpus Embedding; Phishing Detection; Meta-Learners

https://ijsra.net/sites/default/files/fulltext_pdf/IJSRA-2024-1830.pdf

Preview Article PDF

Olayemi Olasehinde, Olayemi Olufunke Catherine and Peter Adetola Adetunji. Stacked ensemble improvement of phishing Email corpus detection based on frequency-based count vector embedding. International Journal of Science and Research Archive, 2024,13(02), 3774-3788. https://doi.org/10.30574/ijsra.2024.13.2.1830

Copyright © Author(s). All rights reserved. This article is published under the terms of the Creative Commons Attribution 4.0 International License (CC BY 4.0), which permits use, sharing, adaptation, distribution, and reproduction in any medium or format, as long as appropriate credit is given to the original author(s) and source, a link to the license is provided, and any changes made are indicated.


All statements, opinions, and data contained in this publication are solely those of the individual author(s) and contributor(s). The journal, editors, reviewers, and publisher disclaim any responsibility or liability for the content, including accuracy, completeness, or any consequences arising from its use.

Get Certificates

Get Publication Certificate

Download LoA

Check Corssref DOI details

Issue details

Issue Cover Page

Editorial Board

Table of content

          

   

Copyright © 2026 International Journal of Science and Research Archive - All rights reserved

Developed & Designed by VS Infosolution