Zhihang Xie (@fbkzhihangxie) 's Twitter Profile
Zhihang Xie

@fbkzhihangxie

ID: 1865029394244251648

calendar_today06-12-2024 13:44:19

3 Tweet

8 Followers

11 Following

Zhihang Xie (@fbkzhihangxie) 's Twitter Profile Photo

New research fuels the debate between cascaded and E2E speech translation! The error propagation is addressed by multiple ASR candidates and HuBERT features to preserve acoustic information lost after ASR. Check the paper by Min et al. (2025) at: arxiv.org/pdf/2502.00377.

Zhihang Xie (@fbkzhihangxie) 's Twitter Profile Photo

ReShape Attention bridges speech & text models without extra parameters. Achieves +8.5% BLEU in translation by leveraging acoustic cues, outperforming cascade/E2E methods. Efficient & scalable. Check the paper by Kano et al. (2025) at: ieeexplore.ieee.org/stamp/stamp.js….

MT Group at FBK (@fbk_mt) 's Twitter Profile Photo

📢 Come and join our group! We offer a fully funded 3-year PhD position: 📔Automatic translation with large multimodal models iecs.unitn.it/education/admi… 📍Full details for application: iecs.unitn.it/education/admi… 📅Deadline May 12, 2025 #NLProc Fondazione Bruno Kessler - FBK

Marco Gaido (@mgaido91) 's Twitter Profile Photo

🚀 New shared task at #WMT2025 (co-located with EMNLP 2025 ): Model Compression for Machine Translation! Can you shrink an LLM and keep translation quality high?🔧 Submit by July 3 and push the limits of efficient NLP! 👉 www2.statmt.org/wmt25/model-co… #NLP #ML #LLM #ModelCompression

Zhihang Xie (@fbkzhihangxie) 's Twitter Profile Photo

🚀 Boost rare-phrase translation in speech! Uses **bilingual dictionaries** (e.g., "climate change"→"Klimawandel") to dynamically bias outputs. ✅ **+21%** recall in streaming ST ✅ **+85%** in multimodal LLMs 🔗: arxiv.org/abs/2506.09175

Zhihang Xie (@fbkzhihangxie) 's Twitter Profile Photo

🚀 Boost rare-phrase translation in speech! Uses **bilingual dictionaries** to dynamically bias outputs. ✅ **+21%** recall in streaming ST ✅ **+85%** in multimodal LLMs 🔗: arxiv.org/abs/2506.09175

Zhihang Xie (@fbkzhihangxie) 's Twitter Profile Photo

🚀 AdvST: Adversarial training aligns speech and text distributions without parallel data! Combines adversarial learning + hidden-state swapping to fix length mismatch & boost low-resource speech translation. ieeexplore.ieee.org/document/10888…