Troubleshooting DOCA Platform Framework
This section provides comprehensive troubleshooting guidance for common issues you may encounter while deploying, configuring, or operating the DOCA Platform Framework (DPF).
Quick Diagnostic Tools
🔍 DPF CLI (dpfctl)
Command-line tool for visualizing, debugging, and troubleshooting DPU resources in Kubernetes. Essential for real-time visibility into resource states and conditions.
Use when:
-
DPU provisioning is failing
-
Need to understand resource dependencies
-
Debugging component readiness issues
📊 System Reports (sosreport)
Generate comprehensive system reports for deeper analysis and support requests.
Use when:
-
Need detailed system information for support cases
-
Investigating complex infrastructure issues
-
Preparing diagnostic data for NVIDIA support
Escalation Path
If you cannot resolve the issue using the guides above:
-
Collect Diagnostic Information * Generate a sosreport for your environment
-
Check Known Issues * Review Release Notes for known issues * Search the GitHub repository for similar problems
-
Contact Support * Open an issue on the GitHub repository * Include diagnostic information and steps to reproduce * For enterprise customers, contact NVIDIA support with your diagnostic package
Additional Resources
-
User Guides - Operational procedures and best practices
-
Architecture - Understanding system design for better troubleshooting
-
API Reference - Complete API documentation for debugging configurations
Last updated: