Advanced Big Data Analytics: Integrating Fuzzy C-Means, Encoder-Decoder CNNs, and Genetic Algorithms for Efficient Clustering and Classification
Keywords:
Fuzzy C-Means (FCM) clustering, optimized Encoder-Decoder, classification, Genetic Algorithms in Clustering, Optimisation
Abstract
In the realm of Big Data analysis, the pivotal question of data clustering takes center stage. This study delves into optimizing this analysis by adopting a hybrid approach that integrates the Fuzzy C-Means (FCM) methodology, Encoder-Decoder Convolutional Neural Networks (CNN), Genetic Algorithms (GAs), and an optimal classification strategy for data clustering and categorization. FCM provides a flexible clustering foundation with its fuzzy logic, while the Encoder-Decoder CNN contributes to extracting complex features and refining the model. Genetic Algorithms finely adjust the parameters of the hybrid model. The optimal classification strategy complements this approach by ensuring precise data categorization. This hybrid strategy leverages the specific strengths of each component, thereby overcoming inherent limitations in each technique. FCM ensures robust cluster formation the Encoder-Decoder CNN improves feature representation, Genetic Algorithms optimize the hyper-parameters of the hybrid model, and optimal classification reinforces the accuracy of data categorization. Experiments conducted on various Big Data sets reveal a significant enhancement in clustering and classification accuracy, as well as overall analysis efficiency. This research represents a substantial contribution to the evolution of Big Data analysis by proposing an integrated solution harnessing the power of FCM, Encoder-Decoder CNN, Genetic Algorithms, and optimal classification The results suggest that this hybrid approach not only increases clustering and classification accuracy but also provides a versatile and adaptable solution to address challenges in large-scale data analysis.
Published
2024-12-15
How to Cite
Belhabib, F., BENSLIMANE, M., & El Moutaouakil, K. (2024). Advanced Big Data Analytics: Integrating Fuzzy C-Means, Encoder-Decoder CNNs, and Genetic Algorithms for Efficient Clustering and Classification. Statistics, Optimization & Information Computing, 13(1), 222-247. https://doi.org/10.19139/soic-2310-5070-1978
Issue
Section
Research Articles
Authors who publish with this journal agree to the following terms:
- Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (See The Effect of Open Access).