Skip to Main Content
Cloud Management and AIOps
Hide about this portal


This is an IBM Automation portal for Cloud Management, Technology Cost Management, Network Automation and AIOps products. To view all of your ideas submitted to IBM, create and manage groups of Ideas, or create an idea explicitly set to be either visible by all (public) or visible only to you and IBM (private), use the IBM Unified Ideas Portal (https://ideas.ibm.com).

Shape the future of IBM!

We invite you to shape the future of IBM, including product roadmaps, by submitting ideas that matter to you the most. Here's how it works:

Search existing ideas

Start by searching and reviewing ideas and requests to enhance a product or service. Take a look at ideas others have posted, and add a comment, vote, or subscribe to updates on them if they matter to you. If you can't find what you are looking for,

Post your ideas
  1. Post an idea.

  2. Get feedback from the IBM team and other customers to refine your idea.

  3. Follow the idea through the IBM Ideas process.

Specific links you will want to bookmark for future use

Welcome to the IBM Ideas Portal (https://www.ibm.com/ideas) - Use this site to find out additional information and details about the IBM Ideas process and statuses.

IBM Unified Ideas Portal (https://ideas.ibm.com) - Use this site to view all of your ideas, create new ideas for any IBM product, or search for ideas across all of IBM.

ideasibm@us.ibm.com - Use this email to suggest enhancements to the Ideas process or request help from IBM for submitting your Ideas.

Native Instana agent collection of Nvidia DCGX telemetry and Nvidia appliances healthcheck

See this idea on ideas.ibm.com

Following the guide on https://www.ibm.com/docs/en/instana-observability/current?topic=applications-monitoring-gpu-public-preview we were able to bring some data into Instana using OpenTelemetry and prometheus format collection and processing of the exported telemetry from Nvidia DCGX. However, specific data is not being labeled or collected by the agent. Also this Opentelemetry customization on a Nvidia appliance architecture is not viable considering that multiple endpoints will need to provide telemetry on the same port to a single Agent installed on the Cluster Manager. Nvidia utilizes a BCM (Bright Cluster Management) + UFM (Unified Fabric Manager) architecture to manage host images that have access to the actual GPUs on the appliance. Installing Instana on the BCM does not give native visility of the Hosts with GPUs, therefore, customization and collection of telemetry using Opentelemetry for environments with hundreds of nodes active under multiple appliances targeting the same port to collect telemetry is not feasible.

We need native support and instrumentation from the Instana agent.

Idea priority High