Abstract

Video–text retrieval via multi-modal masked transformer and adaptive attribute-aware graph convolutional network

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call