TL;DR: We propose ReAlign, a plug-and-play reward-guided alignment strategy for text-to-motion generation, which explicitly enhances both semantic consistency and motion realism throughout the ...
Abstract: Recently, remote sensing image captioning (RSIC) has gained significant attention in the remote sensing community. Due to the significant differences in spatial resolution of remote sensing ...
Prism is a ChatGPT-powered text editor that automates much of the work involved in writing scientific papers. OpenAI just revealed what its new in-house team, OpenAI for Science, has been up to. The ...
Abstract: The task of table-to-text generation involves summarizing and creating natural language descriptions of tables. Previous approaches have used sequence-to-sequence generation methods, which ...
VoiceCraft is a token infilling neural codec language model, that achieves state-of-the-art performance on both speech editing and zero-shot text-to-speech (TTS) on in-the-wild data including ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results