graph TD A[Text Script Input] --> B[Initial Voice Generation] B -->|#170 Turn Script to MP3| C[Initial MP3 URL] C --> D[Convert to WAV] D -->|#178 MP3 to WAV| E[WAV File] E --> F[Generate Waveform] F -->|#179 Create Waveform| G[Waveform JPEG] G --> H[Quality Analysis] H -->|#176 GPT Vision Analysis| I{Quality Check} I -->|Pass| J[Final MP3 URL] I -->|Fail| K[Audio Optimization] K -->|#219 Cut WAV| L[WAV Segments] L -->|#178 Convert & Normalize| M[Normalized MP3] M --> J