graph TD A[Raw WAV Input] --> B[Tempo Detection] A --> C[Stem Separation] B --> D[Analyze BPM] B --> E[Detect Beat Positions] D --> F[Write Tempo Metadata] E --> F C --> G[Extract Vocals] C --> H[Extract Instrumentals] G --> I[Convert Vocal to WAV] H --> J[Convert Instrumental to WAV] F --> K[Metadata JSON] I --> L[Vocal WAV] J --> M[Instrumental WAV] K --> N[Final Output Package] L --> N M --> N style A fill:#f9f,stroke:#333 style N fill:#9ff,stroke:#333