• Login
    View Item 
    •   JScholarship Home
    • Theses and Dissertations, Electronic (ETDs)
    • ETD -- Graduate theses
    • View Item
    •   JScholarship Home
    • Theses and Dissertations, Electronic (ETDs)
    • ETD -- Graduate theses
    • View Item
    JavaScript is disabled for your browser. Some features of this site may not work without it.

    Seq2Seq Scene Graph Generation Utilizing Vision-language Pretrained Model

    Thumbnail
    View/Open
    ZHANG-THESIS-2022.pdf (2.147Mb) (embargoed until: 2023-12-01)
    Date
    2022-12-20
    Author
    Zhang, Chenyu
    Metadata
    Show full item record
    Abstract
    Scene graph generation (SGG) aims to represent a visual scene with a hierarchical structure that contains objects, attributes and relationships. Most existing SGG methods require two stages for detecting objects and predicting their pairwise relationships, which is complicated and computational intensive. Motivated by the recent progress in vision-and-language pretraining (VLP), we propose to formulate SGG as a sequence generation problem that is compatible with the unified VLP pipeline. In this work, we present a one-stage sequence-to-sequence (Seq2Seq) SGG model with a Transformer backbone that can be trained alongside other vision-and-language tasks. Our approach achieves good performance in predicting both object labels and relationships. This study demonstrates the feasibility of formulating SGG as a Seq2Seq task and the potential of improving SGG with the vision-and-language pretrained models.
    URI
    http://jhir.library.jhu.edu/handle/1774.2/68055
    Collections
    • ETD -- Graduate theses

    DSpace software copyright © 2002-2016  DuraSpace
    Policies | Contact Us | Send Feedback
    Theme by 
    Atmire NV
     

     

    Browse

    All of JScholarshipCommunities & CollectionsBy Issue DateAuthorsTitlesSubjectsThis CollectionBy Issue DateAuthorsTitlesSubjects

    My Account

    LoginRegister

    DSpace software copyright © 2002-2016  DuraSpace
    Policies | Contact Us | Send Feedback
    Theme by 
    Atmire NV