Examinando por Autor "Torre Bastida, Ana Isabel"
Mostrando 1 - 4 de 4
Resultados por página
Opciones de ordenación
Ítem ArtifactOps and ArtifactDL: a methodology and a language for conceptualizing and operationalising different types of pipelines(Springer Science and Business Media Deutschland GmbH, 2025-08-07) Miñón Jiménez, Raúl; Díaz de Arcaya Serrano, Josu; Torre Bastida, Ana Isabel; López de Armentia Mendizabal, Juan; Zárate Martínez, Gorka; Bonilla, Lander; Garcia Perez, Asier; Aguirre Usandizaga, JonMachine learning is already integrated in diverse domains enhancing their performance and decision support. For laboratories, this approach is normally sufficient. However, in real environments, these models can not be generally deployed isolated since they require additional steps to satisfy an objective. These steps can range from different data transformations to the inclusion of extra machine learning models which compose an analytic pipeline. Moreover, the majority of software solutions wrap a model into an API and, rarely, focus on the whole pipeline. These are unresolved topics in the well-known MLOps methodology, specifically in packaging and service phases. In addition, these concerns can also be extrapolated to other paradigms like DevOps or DataOps. In the context of the Pliades European project, this paper approaches the conceptualization of diverse types of pipelines from different perspectives and for different contexts, instead of simplifying the deployment and serving to an API. Thus, ArtifactOps methodology is proposed aimed at unifying XXOps paradigms which share the majority of stages. Finally, ArtifactDL pipeline definition language is proposed to describe the key aspects identified when designing different pipelines types and to support the proposed ArtifactOps methodology. Moreover, the research presents two real scenarios to better illustrate both ArtifactOps methodology and ArtifactDL pipeline definition language and it is defined an expert evaluation conducted to validate the approach.Ítem IEM: a unified lifecycle orchestrator for multilingual IaC deployments(Association for Computing Machinery, Inc, 2023-04-15) Díaz de Arcaya Serrano, Josu; Osaba, Eneko ; Benguria, Gorka; Etxaniz Errazkin, Iñaki; López Lobo, Jesús ; Alonso, Juncal; Torre Bastida, Ana Isabel ; Almeida, AitorOver the last few years, DevOps methodologies have promoted a more streamlined operationalization of software components in production environments. Infrastructure as Code (IaC) technologies play a key role in the lifecycle management of applications, as they promote the delivery of the infrastructural elements alongside the application components. This way, IaC technologies aspire to minimize the problems associated with the environment by providing a repeatable and traceable process. However, there are a large variety of IaC frameworks, each of them focusing on a different phase of the operationalization lifecycle, hence the necessity to master numerous technologies. In this research, we present the IaC Execution Manager (IEM), a tool devoted to providing a unified framework for the operationalization of software components that encompasses the various stages and technologies involved in the application lifecycle. We analyze an industrial use case to improve the current approach and conclude the IEM is a suitable tool for solving the problem as it promotes automation, while reducing the learning curve associated with the required IaC technologies.Ítem Orfeon: an AIOps framework for the goal-driven operationalization of distributed analytical pipelines(Elsevier B.V., 2023-03) Díaz de Arcaya Serrano, Josu; Torre Bastida, Ana Isabel; Miñón Jiménez, Raúl; Almeida, AitorThe use of Artificial Intelligence solutions keeps raising in the business domain. However, this adoption has not brought the expected results to companies so far. There are several reasons that make Artificial Intelligence solutions particularly complicated to adopt by businesses, such as the knowledge gap between the data science and operations teams. In this paper, we tackle the operationalization of distributed analytical pipelines in heterogeneous production environments, which span across different computational layers. In particular, we present a system called Orfeon, which can leverage different objectives and yields an optimized deployment for these pipelines. In addition, we offer the mathematical formulation of the problem alongside the objectives in hand (i.e. resilience, performance, and cost). Next, we propose a scenario utilizing cloud and edge infrastructural devices, in which we demonstrate how the system can optimize these objectives, without incurring scalability issues in terms of time nor memory. Finally, we compare the usefulness of Orfeon with a variety of tools in the field of machine learning operationalization and conclude that it is able to outperform these tools under the analyzed criteria, making it an appropriate system for the operationalization of machine learning pipelinesÍtem PADL: a modeling and deployment language for advanced analytical services(MDPI AG, 2020-11-24) Díaz de Arcaya Serrano, Josu; Miñón Jiménez, Raúl; Torre Bastida, Ana Isabel; Ser Lorente, Javier del; Almeida, AitorIn the smart city context, Big Data analytics plays an important role in processing the data collected through IoT devices. The analysis of the information gathered by sensors favors the generation of specific services and systems that not only improve the quality of life of the citizens, but also optimize the city resources. However, the difficulties of implementing this entire process in real scenarios are manifold, including the huge amount and heterogeneity of the devices, their geographical distribution, and the complexity of the necessary IT infrastructures. For this reason, the main contribution of this paper is the PADL description language, which has been specifically tailored to assist in the definition and operationalization phases of the machine learning life cycle. It provides annotations that serve as an abstraction layer from the underlying infrastructure and technologies, hence facilitating the work of data scientists and engineers. Due to its proficiency in the operationalization of distributed pipelines over edge, fog, and cloud layers, it is particularly useful in the complex and heterogeneous environments of smart cities. For this purpose, PADL contains functionalities for the specification of monitoring, notifications, and actuation capabilities. In addition, we provide tools that facilitate its adoption in production environments. Finally, we showcase the usefulness of the language by showing the definition of PADL-compliant analytical pipelines over two uses cases in a smart city context (flood control and waste management), demonstrating that its adoption is simple and beneficial for the definition of information and process flows in such environments.