Skip to Main Content
Cloud Management and AIOps


This is an IBM Automation portal for Cloud Management and AIOps products. To view all of your ideas submitted to IBM, create and manage groups of Ideas, or create an idea explicitly set to be either visible by all (public) or visible only to you and IBM (private), use the IBM Unified Ideas Portal (https://ideas.ibm.com).

Shape the future of IBM!

We invite you to shape the future of IBM, including product roadmaps, by submitting ideas that matter to you the most. Here's how it works:

Search existing ideas

Start by searching and reviewing ideas and requests to enhance a product or service. Take a look at ideas others have posted, and add a comment, vote, or subscribe to updates on them if they matter to you. If you can't find what you are looking for,

Post your ideas
  1. Post an idea.

  2. Get feedback from the IBM team and other customers to refine your idea.

  3. Follow the idea through the IBM Ideas process.

Specific links you will want to bookmark for future use

Welcome to the IBM Ideas Portal (https://www.ibm.com/ideas) - Use this site to find out additional information and details about the IBM Ideas process and statuses.

IBM Unified Ideas Portal (https://ideas.ibm.com) - Use this site to view all of your ideas, create new ideas for any IBM product, or search for ideas across all of IBM.

ideasibm@us.ibm.com - Use this email to suggest enhancements to the Ideas process or request help from IBM for submitting your Ideas.

Status Future consideration
Workspace Instana
Categories Sensor
Created by Guest
Created on May 17, 2022

add cgroups v2 PSI metrics to Instana

With the cgroupv2 starting to emerge maybe is a good idea to start collecting and using the PSI metrics available in machines/containers that enabled cgroupsv2. They are valid for Servers and containers.

You can find details here about the PSI metrics:

https://www.kernel.org/doc/html/latest/accounting/psi.html#psi

https://adil.medium.com/how-to-monitor-server-via-psi-pressure-stall-information-and-cgroupv2-2d944a9e732e


Looking at Load Average (existing metric) doesn't give you the full perception of what is happening in your systems. Also Load average gives you only a 1min average value which in the container world is too long. Also, you need to relate Load average with other metrics to understand if you have problems.


With PSI (Pressure Stall Information), it identifies and quantifies the disruptions caused by such resource crunches and the time impact it has on complex workloads or even entire systems.

It creates 3 different files:

/proc/pressure/cpu ,

/proc/pressure/memory ,

/proc/pressure/io

Inside those files you have the pressure metrics as 10, 60, 300 sec average.

Example for CPU:

root:~# cat /proc/pressure/cpu
some avg10=0.03 avg60=0.07 avg300=0.06 total=5376072182

Avg10: How long have the processes stalled for the last 10 seconds
Avg60: How long have the processes stalled for the last 60 seconds
Avg300: How long have the processes stalled for the last 300 seconds
Total: How long have the processes stalled since the server booted

If a process was starved of the CPU for 5 seconds in the last 10 seconds, the Avg10 column will be 50, which means 50% of the last 10 seconds.

Idea priority High