Balancing content retention in neural style transfer

Agrawal, Yash; Abrol, Vinayak (Advisor)

Home
→
Electronics and Communication Engineering
→
MTech Theses
→
Year-2025
→
View Item

dc.contributor.author	Agrawal, Yash
dc.contributor.author	Abrol, Vinayak (Advisor)
dc.date.accessioned	2026-04-03T07:30:13Z
dc.date.available	2026-04-03T07:30:13Z
dc.date.issued	2025-05
dc.identifier.uri	http://repository.iiitd.edu.in/xmlui/handle/123456789/1832
dc.description.abstract	This thesis work focuses on advancements in neural style transfer, a process that enables the blending of content and style features to generate stylized images. It explores feature extraction using two encoders: a VGG19-based encoder and a GLOW based encoder, the latter improving image reconstruction and reducing content leakage through reversible transformations. Various feature fusion techniques are examined, including Adaptive Instance Normalization (AdaIN), Adaptive Attention Normalization (AdaAttN), Self-Attention Network (SANet), Multi-Channel Correlation Network (MCCNet), and Exact Feature Matching, leveraging statistical matching and attention mechanisms. The study also evaluates the impact of different loss functions such as content loss, style loss, identity loss, and contrastive loss on the quality of the output. Custom transformation blocks are introduced, combining methods like feature concatenation, AdaIN with alternative normalizations, and GLOW-based encoders enhanced with attention modules. Existing architectures, such as AdaIN and Exact Feature Matching, are further refined by integrating additional losses to enhance stylization fidelity and preserve content.	en_US
dc.language.iso	en_US	en_US
dc.publisher	IIIT-Delhi	en_US
dc.subject	Convulutional Neural Network	en_US
dc.subject	Style Transfer	en_US
dc.subject	Encoder & De-coder	en_US
dc.subject	SANet	en_US
dc.subject	MCCNet	en_US
dc.subject	Identity Loss	en_US
dc.title	Balancing content retention in neural style transfer	en_US
dc.type	Thesis	en_US