Tongjia Chen

Tongjia Chen 陈同嘉

Pronounced like 'TomJade', or you can just call me Tom :)

I'm an incoming CS Ph.D. student at the University of Melbourne, under the supervision of Prof. Eun-Jung Holden and Prof. Krista Ehinger.

I was fortunate to have been working closely with Prof. Chen Chen on multi-modality video understanding.

Email / Github / Google Scholar / LinkedIn

News

[2025-02] Our GroundMoRE was accepted at CVPR 2025.

[2024-02] Our paper OST: Refining Text Knowledge with Optimal Spatio-Temporal Descriptor for General Video Recognition was accepted at CVPR 2024. See you in Seattle!

[2023-06] We won the 1st place of the AQTC Challenge at CVPR@23 LOng-form VidEo Understanding and Generation (LOVEU) Workshop.

[2023-02] Our paper AShapeFormer: Semantics-Guided Object-Level Active Shape Encoding for 3D Object Detection via Transformers was accepted at CVPR 2023.

Preprints

PointDiffuse: A Dual-Conditional Diffusion Model for Enhanced Point Cloud Semantic Segmentation
Yong He, Hongshan Yu, Mingtao Feng, Tongjia Chen, Zechuan Li, Anwaar Ulhaq, Saeed Anwar, Ajmal Mian
Preprint, 2025
arXiv

Soft Masked Transformer for Point Cloud Processing with Skip Attention-Based Upsampling
Yong He, Hongshan Yu, Muhammad Ibrahim, Xiaoyan Liu, Tongjia Chen, Anwaar Ulhaq, Ajmal Mian
Preprint, 2024
arXiv

Publications

Motion-Grounded Video Reasoning: Understanding and Perceiving Motion at Pixel Level
Andong Deng, Tongjia Chen, Shoubin Yu, Taojiannan Yang, Lincoln Spencer, Yapeng Tian, Ajmal Mian, Mohit Bansal, Chen Chen
CVPR 2025
Project / arXiv

OST: Refining Text Knowledge with Optimal Spatio-Temporal Descriptor for General Video Recognition
Tongjia Chen, Hongshan Yu, Zhengeng Yang, Zechuan Li, Wei Sun, Chen Chen
CVPR 2024
Project / Code / arXiv

	First Place Solution to the CVPR'2023 AQTC Challenge: A Function-Interaction Centric Approach with Spatiotemporal Visual-Language Alignment Tongjia Chen, Hongshan Yu, Zhengeng Yang, Ming Li, Zechuan Li, Jingwen Wang, Wei Miao, Wei Sun, Chen Chen Tech Report, LOVEU Workshop, CVPR 2023 Code / Certificate
	AShapeFormer: Semantics-Guided Object-Level Active Shape Encoding for 3D Object Detection via Transformers Zechuan Li, Hongshan Yu, Zhengeng Yang, Tongjia Chen, Naveed Akhtar CVPR 2023 Project / Code / Video

Awards

1st place of the AQTC Challenge at CVPR@23 LOng-form VidEo Understanding and Generation (LOVEU) Workshop. 2023

Academic Excellence Scholarship of HNU (Top 20%). 2021, 2022

Academic Excellence Scholarship of CUG (Top 10%). 2017-2020

Talks

Time: 2023.6.18

Title: First Place Solution to the CVPR'2023 AQTC Challenge: A Function-Interaction Centric Approach with Spatiotemporal Visual-Language Alignment

Source: LOng-form VidEo Understanding and Generation (LOVEU) Workshop, CVPR 2023.

Video / Slides

Misc

CrossFitter / Culer / TIFOSI

Updated at Jul. 2024

Thanks Jon Barron for this amazing template.