谷歌浏览器插件
订阅小程序
在清言上使用

diffvg+CLIP: Generating Painting Trajectories from Text

Gerry Chen, Alice Dumay, Mengyi Tang

semanticscholar(2021)

引用 2|浏览0
暂无评分
摘要
Given a sentence, the goal of our project is to find out the optimized trajectories for a painting robot to paint an image that illustrates the context of the sentence. We investigate two solution approaches. In the first approach, we consider two stages: in the first stage a set of candidate images are generated in a classification task to match the given sentence, and in the second stage a candidate raster image is simplified into a robot-paint-image along with a vector trajectory with a color labels. For the first stage, we implemented CLIP (Contrastive Language-Image Pre-training), a computer vision system released by OpenAI to generate candidate images. In the second stage, we proposed two different approaches: (i) using an off-the-shelf differentiable vector graphics rasterizing library named diffvg and (ii) applying a simple image segmentation task SLIC and backpropagating through a probability-based network to achieve a set of probability measure, which can be represented as the trajectory. In the second approach, we pair diffvg with CLIP and back-propogate through the entire chain to produce vector graphics which are described by a sentence. Additional details are provided in Section 2.
更多
查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要