[NS][Lab_Seminar_240826]Learning Graph Neural Networks for Image Style Transfer.pptx

thanhdowork 71 views 21 slides Aug 27, 2024

Slide 1 of 21

About This Presentation

Learning Graph Neural Networks for Image Style Transfer

Size: 3.22 MB

Language: en

Added: Aug 27, 2024

Slides: 21 pages

Slide Content

Learning Graph Neural Networks for Image Style Transfer Tien-Bach-Thanh Do Network Science Lab Dept. of Artificial Intelligence The Catholic University of Korea E-mail: os fa19730 @catholic.ac.kr 202 4/08/26 Yongcheng Jing et al. ECCV 2022

Introduction What is Image style transfer? Transferring the artistic style from a source image to a content image Applications: digital art, photo editing, visual effects Challenges: Parametric methods: tend to distort local style patterns Non-parametric methods: often introduce artifacts due to patch mismatching

Motivation Problems with existing methods: Parametric: good global style transfer but poor local detail preservation Non-parametric: good local detail but prone to mismatches and artifacts Solution: Semi-parametric approach using GNNs to balance global and local style transfer Fig. 1. Existing parametric [14, 1, 30] and non-parametric [6, 42] NST methods either barely transfer the global style appearance to the target [6], or produce distorted local style patterns [14, 1, 30] and undesired artifacts [42]. By contrast, the proposed GNN-based semi-parametric approach achieves superior stylization performance in the transfers of both global stroke arrangement and local fine-grained patterns.

Method Overview Semi-parametric approach: Combine the strengths of both parametric and non-parametric methods GNNs are used to establish fine-grained content-style correspondences Key techniques: Graph construction: content and style patches as nodes Message passing: Heterogeneous GNN for style-content interaction Deformable graph convolution: handles cross-scale style-content matching

Method Stylization Graph Construction Nodes: content and style patches Edges: Inter-Domain edges: between content and style nodes Intra-Domain edges: within content nodes to ensure style coherence k-NN search Employ distance metric of normalized cross-correlation for pair-wise k-NN Target k-NN style nodes for content patch content and style patches k largest elements from set pair-wise NCC

Method Deformable Graph Convolution Need for scale adaptation: different content regions require different style scales (stroke sizes) Solution: deformable scale predictor adjusts patch sizes for optimal style-content matching Style-to-Content Message Passing Apply attention coefficient for each style node during message passing Content-to-Content Message Passing Use Feat2Patch

Method Local Patch-based manipulation Patch extraction: content and style feature patches extracted using sliding windows GNNs: Used to model the stylization process as message passing between content and style nodes Adaptive feature aggregation based on attention mechanisms

Method Global Feature Refinement Problem: local patch-based manipulation may lack global style coherence Solution: A global feature refinement stage using Adaptive Instance Normalization Ensure the final image preserves both fine details and global style appearance

Method Feature Encoding Decoding process: convert the refined features back into image Components: convolutional and bilinear upsampling layers with ReLU activations Outcome : generate the final stylized image with preserved content and applied style

Method Loss Function and Training Strategy Content loss Style loss mapping BN statistics over the feature maps mean standard deviation

Method Loss Function and Training Strategy

Experiments Experimental Settings Stylization graph construction stage, k = 5 by default for NCC-based KNN search Sliding window = 1, kernel size = 5*5 Employ pre-trained VGG-19 Dataset Microsoft COCO WikiArt Deep Graph Library

Experiments Qualitative comparison Fig. 3. Qualitative results of our proposed GNN-based semi-parametric stylization algorithm and other parametric [30, 14, 1] and non-parametric [6, 42] methods.

Experiments Efficiency analysis Table 1. Average speed comparison in terms of seconds per image

Experiments Ablation Studies - Heterogeneous aggregation schemes Fig. 4. Comparative results of using various aggregation mechanisms for heterogeneous message passing, including graph attention network (GAT) [43], graph convolutional network (GCN) [24], graph isomorphism network (GIN) [48], dynamic graph convolution (EdgeConv) [46], and GraphSage [12]. The GAT mechanism generally yields superior stylization results, thanks to its attention-based aggregation scheme in Eq. 2.

Experiments Ablation Studies - Stylization w/ and w/o the deformable scheme Fig. 5. Results of the equal-size patch division method and the proposed deformable one with a learnable scale predictor. Our deformable scheme allows for cross-scale style content matching, thereby leading to spatially-adaptive multi-stroke stylization with an enhanced semantic saliency (e.g., the foreground regions of the horse and squirrel)

Experiments Ablation Studies - Graph w/ and w/o intra-domain edges Fig. 6. Results of removing the content-to-content intra-domain edges (w/o Intra) and those with the intra-domain ones (w/ Intra). The devised intra-domain connections incorporate the inter-relationship between the stylized patches at different locations, thereby maintaining the global stylization coherence (e.g., the eye regions in the figure)

Experiments Ablation Studies - Euclidean distance vs normalized cross-correlations Fig. 7. Results obtained using Euclidean distance and normalized cross-correlation (NCC) for similarity measurement during the construction of heterogeneous edges

Experiments Ablation Studies - Various patch sizes Fig. 8. Results obtained using various patch sizes for constructing content and style vertices in the local patch-based manipulation module. By using a larger patch size, the stylized results can maintain an overall larger stroke size

Experiments Diversified Stylization Control Fig. 9. Flexible control of diversified patch-based arbitrary style transfer during inference. The proposed GNN-based semi-parametric stylization scheme makes it possible to generate heterogeneous style patterns with only a single trained model.

Conclusion Key contributions: Introduce a semiparametric style transfer method using GNNs Achieve superior results in both global and local style transfer Future work: Extend the GNN-based approach to other vision tasks Optimize the k-NN search process for faster stylization

[NS][Lab_Seminar_240826]Learning Graph Neural Networks for Image Style Transfer.pptx

About This Presentation

Slide Content

Tags

Categories

Download

Quick Actions

Statistics

Related Slideshows

[NS][Lab_Seminar_240826]Learning Graph Neural Networks for Image Style Transfer.pptx

About This Presentation

Slide Content

Slide 1

Slide 2

Slide 3

Slide 4

Slide 5

Slide 6

Slide 7

Slide 8

Slide 9

Slide 10

Slide 11

Slide 12

Slide 13

Slide 14

Slide 15

Slide 16

Slide 17

Slide 18

Slide 19

Slide 20

Slide 21

Tags

Categories

Download

Quick Actions

Statistics

Related Slideshows

TLE-9-Prepare-Salad-and-Dressing.pptxkkk

LESSON 1 ABOUT MEDIA AND INFORMATION.pptx

GRADE-8-AQUACULTURE-WEEKQ1.pdfdfawgwyrsewru

Feelings PP Game FOR CHILDREN IN ELEMENTARY SCHOOL.pptx

Jeopardy_Figures_of_Speech_Template.pptx [Autosaved].pptx

Jeopardy_Figures_of_Speech.pptxvdsvdsvsdvsd