Elyra’s Jupyter AI Pipelines Now Support Custom Components – InApps 2025

Main Contents:

Elyra’s Jupyter AI Pipelines Now Support Custom Components – InApps is an article under the topic Software Development Many of you are most interested in today !! Today, let’s InApps.net learn Elyra’s Jupyter AI Pipelines Now Support Custom Components – InApps in today’s post !

Key Summary

Overview: The 2022 article by InApps Technology discusses updates to Elyra, an open-source extension for JupyterLab, which now supports custom components in its visual AI pipeline editor, enhancing flexibility for data scientists and developers.
Key Points:
- Elyra Overview: Elyra extends JupyterLab with tools for building, managing, and deploying AI and data science workflows, focusing on visual pipeline creation for machine learning (ML) and data processing.
- Custom Components Support:
  - Allows users to define and integrate custom nodes (e.g., data preprocessing, model training) into Elyra’s pipeline editor.
  - Enables tailored workflows by incorporating proprietary or specialized scripts, beyond pre-built components.
  - Supports Python scripts, Jupyter Notebooks, and external tools as custom nodes, executable within pipelines.
- How It Works:
  - Users create custom components via YAML configuration files, specifying inputs, outputs, and execution logic.
  - Components are added to the visual pipeline editor, where they can be connected to form end-to-end workflows.
  - Pipelines can be executed on platforms like Kubernetes, Apache Airflow, or Kubeflow, ensuring scalability.
- Integration:
  - Seamlessly works with JupyterLab’s notebook interface for iterative development.
  - Supports cloud-native environments, integrating with tools like AWS, Azure, or on-premises Kubernetes clusters.
- Benefits:
  - Increases flexibility for complex, domain-specific AI workflows (e.g., healthcare, finance).
  - Reduces dependency on rigid, pre-defined components, enabling innovation.
  - Simplifies pipeline management for non-expert users through a visual interface.
  - Cost-effective development with offshore teams (e.g., Vietnam at $20-$40/hour via InApps Technology).
- Use Cases:
  - Building custom ML pipelines for tasks like image processing or natural language processing.
  - Automating data workflows in research or enterprise settings with proprietary algorithms.
  - Prototyping and deploying AI models in scalable, cloud-native environments.
- Challenges:
  - Requires familiarity with YAML and pipeline orchestration for custom component creation.
  - Integration with external platforms (e.g., Kubeflow) may need additional configuration.
Context: Elyra’s update aligns with the growing demand for flexible, user-friendly tools in MLOps, enhancing JupyterLab’s role in data science workflows.
Recommendations:
- Leverage Elyra for rapid prototyping of custom AI pipelines in JupyterLab.
- Use Kubernetes or Kubeflow for scalable pipeline execution in production.
- Partner with InApps Technology for expertise in Elyra implementation and offshore development to optimize costs.

Read more about Elyra’s Jupyter AI Pipelines Now Support Custom Components – InApps at Wikipedia

You can find content about Elyra’s Jupyter AI Pipelines Now Support Custom Components – InApps from the Wikipedia website

Elyra, the artificial intelligence (AI) toolkit first released by IBM in early 2020, helps data scientists with the often difficult process of building AI pipelines. As they wrote in the tool’s introductory post, “Building an AI pipeline for a model is hard. Breaking down and modularizing a pipeline is harder.” A data pipeline can include a number of steps, some relying on others, and creating this pipeline can lie outside the core skills needed for data science. Elyra solves this by offering a visual interface that turns creating and altering data pipelines into a familiar experience.

Patrick Titzler, a developer advocate at the Center for Open-Source Data and AI Technologies at IBM, explained that Elyra lets users assemble basic building blocks — Jupyter notebooks, Python scripts, and R scripts — into a pipeline that lets them perform tasks in sequence, in parallel, or otherwise.

“If you go through a machine learning workflow, you might have to load the data, analyze the data, cleanse it, then build the model, train the model, tune the model. And then you might have to go back when the results don’t really meet your expectations,” said Titzler. “With a pipeline editor, you can create those pipelines using simple drag and drop and then configure the nodes in that pipeline. So it speeds up your development because you don’t have to write any custom code to run all of those components or nodes in the pipeline. Plus, it enables people to actually do these things without necessarily having a deep domain expertise.”

This has been arguably the project’s most important feature, but the building blocks remained limited to those three types. With the recently released Elyra 3.3, however, users can create pipelines using custom components, a feature that Titzler wrote was “a major milestone on our roadmap.”

Previously, Elyra users could string together their own Jupyter notebooks or scripts, but they didn’t have access to external components, such as those available in Kubeflow Pipelines or Apache Airflow, the two platforms for running pipelines currently supported by Elyra. For example, the following image shows the components that are now available with Kubeflow Pipelines in Elyra, which includes things like creating a dataset volume or counting rows.

Another example of a set of components that can be added to Elyra with these changes is the Machine Learning Exchange, which provides an open source Data and AI assets catalog and execution engine for Kubeflow Pipelines. Titzler also points to the Component Library for AI, Machine Learning, ETL, and Data Science (CLAIMED) as an example. CLAIMED is a set of Jupyter notebooks that implement tasks such as data loading, data transformation, or model training, and can be used in Elyra as of this last release. CLAIMED can now be used by simply cloning the CLAIMED repository, and then the pipelines in that repository can then be opened in the pipeline editor for immediate use.

Titzler cautions that custom components differ from other components in Elyra in a few ways. First, they are runtime specific and often use runtime-specific mechanisms to exchange data with other components, instead of the S3-compatible storage used by generic components. They also need to be managed separately and are black boxes. While the Visual Pipeline Editor can expose their input and output, it does not have access to the functionality itself, necessarily.

Currently, Elyra only has support for local execution, Kubeflow Pipelines, and Apache Airflow, in terms of pipeline orchestration, but Titzler says that the community at large is in the process of adding others. He says that he has heard of interest in both Ray and Argo, but that any movement in those directions currently depends on the efforts of the community.

Looking forward, Titzler says that the project has “a big list of wishes that have come from various sources” but that improvements to the visual editor and increasing usability were among the current aspirations.

Rate this post

Anh Hoang

Anh Hoang is Head of SEO Optimization at InApps Technology, ensuring that the message and research of InApps Technology reach the most people possible while adhering to our strict journalistic standards of excellence and integrity.