Hardware Management/FMFM
Jump to navigation
Jump to search
Welcome to the OCP Fleetscale Memory Fault Management (FMFM) WIKI
Fleetscale Memory Fault Management is a Worksteam within the Hardware Management Project.
Leadership
Scope
The FMFM is a workstream about standardization of Fleetscale Memory Fault Management
- Proposed topics:
- Standardize vendor agnostic architecture for memory error handling
- Modularization of inputs from different hardware vendors
- APIs and connections between different modules from different vendors.
- Define the output of each module (failure cause, health information, RAS actions, etc.)
- Standardize memory error telemetry
- Format content for better fleet scale RAS management
- Troubleshooting, FRU replacement policies, etc.
- Coordinate with the broader OCP group to make sure there is a path for this general architecture
Get Involved
Subproject Meets Biweekly on Tuesday from 7:00-8:00 am PST
- Link to the FMFM Calendar
- Link to the Meeting
- You can also dial in using your phone : United States: +1 (646) 749-3112 Access Code: 454-746-381
Mailing List
Participate in the discussion:
- FMFM on OCP Groups.io: FMFM Group Link
- Subscribe to mailing list
- Post to mailing list
Documents
- Link to Fleetscale Memory Fault Management (FMFM) Workstream Proposal
- Link to Fleetscale Memory Fault Management (FMFM) Framework Requirements
Past Presentation Recordings
FMFM Weekly Call Recordings
- Nov 19, 2024
- Nov 05, 2024
- Oct 22, 2024
- Oct 08, 2024
- Sep 24, 2024
- Sep 10, 2024
- Aug 27, 2024
- Aug 13, 2024
- Jul 30, 2024
- Jul 16, 2024
- Jun 18, 2024
- Jun 04, 2024
- May 21, 2024
- May 07, 2024
- Apr 23, 2024
- Apr 09, 2024
- Mar 26, 2024
- Mar 12, 2024
- Feb 27, 2024
- Feb 13, 2024
- Jan 30, 2024
- Jan 16, 2024
- Jan 2, 2024
- Dec 5, 2023
- Nov 21, 2023
- Nov 7, 2023
- Oct 24, 2023
- Oct 10, 2023
- Sep 26, 2023
- Sep 12, 2023
- Aug 29, 2023