Improving Text-to-Image Generation with Multimodal Semantic Coherence in Adversarial Training
Research in the field of text to image generation has shown incredible momentum owing to the availability of more powerful natural language processing (NLP) models and generative networks. The quality of data representation learned by the generative models acts as a determining factor in the success of these models. Self-supervised learning augments the generative power […]