IIIT-Delhi Institutional Repository

Browsing by Author "Abrol, Vinayak (Advisor)"

Browsing by Author "Abrol, Vinayak (Advisor)"

Sort by: Order: Results:

  • Thakran, Yash; Abrol, Vinayak (Advisor) (IIIT-Delhi, 2023-05-09)
    Abusive content detection in the spoken text can be addressed by performing Automatic Speech Recognition (ASR) and leveraging advancements in natural language processing. However, ASR models introduce latency and often ...
  • Singh, Barneet; Abrol, Vinayak (Advisor) (IIIT-Delhi, 2025-05)
    This thesis explores advanced techniques in the field of audio spoofing detection. With the emergence of high-quality deepfake generation techniques and the vulnerabilities in automatic speaker verification (ASV) systems, ...
  • Agrawal, Yash; Abrol, Vinayak (Advisor) (IIIT-Delhi, 2025-05)
    This thesis work focuses on advancements in neural style transfer, a process that enables the blending of content and style features to generate stylized images. It explores feature extraction using two encoders: a VGG19-based ...
  • Verma, Akash; Abrol, Vinayak (Advisor) (IIIT-Delhi, 2024-05-21)
    Speaker verification (SV) focuses on confirming or denying the claimed identity of a speaker. It is a one-to-one comparison between the test utterance and the claimant’s stored reference voice sample. This process is ...
  • Thakran, Yash; Abrol, Vinayak (Advisor) (IIIT-Delhi, 2023-12-08)
    Modeling directly raw waveforms through neural networks for speech processing is gaining more and more attention. Despite its varied success, a question that remains is: what kind of information are such neural networks ...
  • Deepika, N; Abrol, Vinayak (Advisor) (IIIT-Delhi, 2025-05)
    This thesis investigates two sequential studies toward real-time music synthesis directly from images via learned cross-modal embedding mappings and presents a unified deep-learning framework. In Study I, we explored a ...
  • Patil, Akshet; Abrol, Vinayak (Advisor) (IIIT-Delhi, 2025-05)
    This thesis introduces a novel framework for language translation, transitioning from conventional text-based mapping to a phoneme-level modeling approach. By employing articulatory phoneme representations and sparse binary ...
  • Chaudhary, Aryan; Abrol, Vinayak (Advisor) (IIIT-Delhi, 2024-05-01)
    This thesis explores the integration of quaternion algebra into neural network architectures to enhance their efficiency for diverse audio processing tasks. Quaternion-based transformations are employed to achieve structural ...

Search Repository

Browse

My Account