首页> 外文会议>IEEE/CVF Conference on Computer Vision and Pattern Recognition >Sketch-BERT: Learning Sketch Bidirectional Encoder Representation From Transformers by Self-Supervised Learning of Sketch Gestalt
【24h】

Sketch-BERT: Learning Sketch Bidirectional Encoder Representation From Transformers by Self-Supervised Learning of Sketch Gestalt

机译:Sketch-BERT:通过草图格式塔的自学学习,从变压器学习草图双向编码器表示

获取原文

摘要

Previous researches of sketches often considered sketches in pixel format and leveraged CNN based models in the sketch understanding. Fundamentally, a sketch is stored as a sequence of data points, a vector format representation, rather than the photo-realistic image of pixels. SketchRNN studied a generative neural representation for sketches of vector format by Long Short Term Memory networks (LSTM). Unfortunately, the representation learned by SketchRNN is primarily for the generation tasks, rather than the other tasks of recognition and retrieval of sketches. To this end and inspired by the recent BERT model, we present a model of learning Sketch Bidirectional Encoder Representation from Transformer (Sketch-BERT). We generalize BERT to sketch domain, with the novel proposed components and pre-training algorithms, including the newly designed sketch embedding networks, and the self-supervised learning of sketch gestalt. Particularly, towards the pre-training task, we present a novel Sketch Gestalt Model (SGM) to help train the Sketch-BERT. Experimentally, we show that the learned representation of Sketch-BERT can help and improve the performance of the downstream tasks of sketch recognition, sketch retrieval, and sketch gestalt.
机译:草图的先前研究通常将草图视为像素格式,并在草图理解中利用了基于CNN的模型。从根本上说,草图存储为数据点序列,矢量格式表示形式,而不是像素的逼真的图像。 SketchRNN通过长短期记忆网络(LSTM)研究了矢量格式草图的生成神经表示。不幸的是,SketchRNN学习的表示形式主要用于生成任务,而不是用于草图的识别和检索的其他任务。为此,在最近的BERT模型的启发下,我们提出了一个从Transformer(Sketch-BERT)学习草图双向编码器表示的模型。我们将BERT推广到草图领域,并使用新颖的提议组件和预训练算法,包括新设计的草图嵌入网络以及草图格式手势的自监督学习。特别是针对预训练任务,我们提出了一种新颖的Sketch Gestalt模型(SGM),以帮助训练Sketch-BERT。通过实验,我们表明,Sketch-BERT的学习表示形式可以帮助并提高草图识别,草图检索和草图格式手势的下游任务的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号