Joint Text and Audio Multi-modal Speaker Diarization. Mutian Li. MS Thesis, Computer Science, Emory University, 2025.