Home
International Journal of Science and Research Archive
International, Peer reviewed, Open access Journal ISSN Approved Journal No. 2582-8185

Main navigation

  • Home
    • Journal Information
    • Abstracting and Indexing
    • Editorial Board Members
    • Reviewer Panel
    • Journal Policies
    • IJSRA CrossMark Policy
    • Publication Ethics
    • Issue in Progress
    • Current Issue
    • Past Issues
    • Instructions for Authors
    • Article processing fee
    • Track Manuscript Status
    • Get Publication Certificate
    • Become a Reviewer panel member
    • Join as Editorial Board Member
  • Contact us
  • Downloads

ISSN Approved Journal || eISSN: 2582-8185 || CODEN: IJSRO2 || Impact Factor 8.2 || Google Scholar and CrossRef Indexed

Peer Reviewed and Referred Journal || Free Certificate of Publication

Research and review articles are invited for publication in March 2026 (Volume 18, Issue 3) Submit manuscript

Efficient resource allocation for generative AI workloads in cloud-native infrastructures: A multi-tiered approach

Breadcrumb

  • Home
  • Efficient resource allocation for generative AI workloads in cloud-native infrastructures: A multi-tiered approach

Kiran Randhi 1, * and Srinivas Reddy Bandarapu 2

1 Principal Solutions Architect.
2 Principal Cloud Architect.

Research Article
 

International Journal of Science and Research Archive, 2024, 13(02), 826-839.
Article DOI: 10.30574/ijsra.2024.13.2.2208
DOI url: https://doi.org/10.30574/ijsra.2024.13.2.2208

Received on 07 October 2024; revised on 12 November 2024; accepted on 15 November 2024

Resource management becomes essential in ensuring that generative AI workloads in cloud-native infrastructures deliver the best results. The architecture described in this article targets such workloads due to their inherent fluctuations in resource usage and the difficulties in scaling them. The proposed framework divides resources into groups to guarantee that applications are given support based on difficulty level. The features of the proposed methodology are the performance assessment of resource distribution effectiveness, taking into account metrics, including latency, throughput, and utilization rates. Furthermore, examples have been provided to support the use of this approach and its efficiency in real-life situations. Based on these, applying the multi-tiered approach to resource management improves the organization's operations performance and minimizes expenses connected with resource provisioning. Such a study also emphasizes the importance of developing flexible and effective resource management tools that can be especially useful in modern generative AI development environments.

Generative AI; Resource Allocation; Cloud-Native Infrastructure; Multi-Tiered Approach; Performance Metrics

https://ijsra.net/sites/default/files/fulltext_pdf/IJSRA-2024-2208.pdf

Preview Article PDF

Kiran Randhi and Srinivas Reddy Bandarapu. Efficient resource allocation for generative AI workloads in cloud-native infrastructures: A multi-tiered approach. International Journal of Science and Research Archive, 2024, 13(02), 826-839. https://doi.org/10.30574/ijsra.2024.13.2.2208

Copyright © Author(s). All rights reserved. This article is published under the terms of the Creative Commons Attribution 4.0 International License (CC BY 4.0), which permits use, sharing, adaptation, distribution, and reproduction in any medium or format, as long as appropriate credit is given to the original author(s) and source, a link to the license is provided, and any changes made are indicated.


All statements, opinions, and data contained in this publication are solely those of the individual author(s) and contributor(s). The journal, editors, reviewers, and publisher disclaim any responsibility or liability for the content, including accuracy, completeness, or any consequences arising from its use.

Get Certificates

Get Publication Certificate

Download LoA

Check Corssref DOI details

Issue details

Issue Cover Page

Editorial Board

Table of content

          

   

Copyright © 2026 International Journal of Science and Research Archive - All rights reserved

Developed & Designed by VS Infosolution