Distilling step-by-step! outperforming larger language models with less training data and smaller model sizes CY Hsieh, CL Li, CK Yeh, H Nakhost, Y Fujii, A Ratner, R Krishna, CY Lee, ... arXiv preprint arXiv:2305.02301, 2023 | 460 | 2023 |
Towards unconstrained end-to-end text spotting S Qin, A Bissacco, M Raptis, Y Fujii, Y Xiao Proceedings of the IEEE/CVF international conference on computer vision …, 2019 | 149 | 2019 |
A scalable handwritten text recognition system RR Ingle, Y Fujii, T Deselaers, J Baccash, AC Popat 2019 International conference on document analysis and recognition (ICDAR …, 2019 | 135 | 2019 |
Towards end-to-end unified scene text detection and layout analysis S Long, S Qin, D Panteleev, A Bissacco, Y Fujii, M Raptis Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022 | 104 | 2022 |
Chain-of-table: Evolving tables in the reasoning chain for table understanding Z Wang, H Zhang, CL Li, JM Eisenschlos, V Perot, Z Wang, L Miculicich, ... arXiv preprint arXiv:2401.04398, 2024 | 87 | 2024 |
Formnet: Structural encoding beyond sequential modeling in form document information extraction CY Lee, CL Li, T Dozat, V Perot, G Su, N Hua, J Ainslie, R Wang, Y Fujii, ... arXiv preprint arXiv:2203.08411, 2022 | 86 | 2022 |
Tool documentation enables zero-shot tool-usage with large language models CY Hsieh, SA Chen, CL Li, Y Fujii, A Ratner, CY Lee, R Krishna, T Pfister arXiv preprint arXiv:2308.00675, 2023 | 67 | 2023 |
Rethinking text line recognition models DH Diaz, S Qin, R Ingle, Y Fujii, A Bissacco arXiv preprint arXiv:2104.07787, 2021 | 57 | 2021 |
Sequence-to-label script identification for multilingual ocr Y Fujii, K Driesen, J Baccash, A Hurst, AC Popat 2017 14th IAPR international conference on document analysis and recognition …, 2017 | 49 | 2017 |
A web-based ocr service for documents J Walker, Y Fujii, AC Popat Proceedings of the 13th IAPR international workshop on document analysis …, 2018 | 42 | 2018 |
Publication date estimation for printed historical documents using convolutional neural networks Y Li, D Genzel, Y Fujii, AC Popat Proceedings of the 3rd international workshop on historical document imaging …, 2015 | 34 | 2015 |
A robust/fast spoken term detection method based on a syllable n-gram index with a distance metric S Nakagawa, K Iwami, Y Fujii, K Yamamoto Speech Communication 55 (3), 470-485, 2013 | 32 | 2013 |
Large vocabulary speech recognition system: SPOJUS++ Y Fujii, K Yamamoto, S Nakagawa Proc. International Conference MUSP, 110-118, 2011 | 31 | 2011 |
Rope: reading order equivariant positional encoding for graph-based document information extraction CY Lee, CL Li, C Wang, R Wang, Y Fujii, S Qin, A Popat, T Pfister arXiv preprint arXiv:2106.10786, 2021 | 29 | 2021 |
Class lecture summarization taking into account consecutiveness of important sentences. Y Fujii, K Yamamoto, N Kitaoka, S Nakagawa INTERSPEECH, 2438-2441, 2008 | 29 | 2008 |
Post-ocr paragraph recognition by graph convolutional networks R Wang, Y Fujii, AC Popat Proceedings of the IEEE/CVF Winter Conference on Applications of Computer …, 2022 | 28 | 2022 |
Out-of-vocabulary term detection by n-gram array with distance from continuous syllable recognition results K Iwami, Y Fujii, K Yamamoto, S Nakagawa 2010 IEEE Spoken Language Technology Workshop, 212-217, 2010 | 26 | 2010 |
Automatic extraction of cue phrases for important sentences in lecture speech and automatic lecture speech summarization. Y Fujii, N Kitaoka, S Nakagawa INTERSPEECH, 2801-2804, 2007 | 21 | 2007 |
Automatic speech recognition using hidden conditional neural fields Y Fujii, K Yamamoto, S Nakagawa 2011 IEEE International Conference on Acoustics, Speech and Signal …, 2011 | 18 | 2011 |
ICDAR 2023 competition on hierarchical text detection and recognition S Long, S Qin, D Panteleev, A Bissacco, Y Fujii, M Raptis International Conference on Document Analysis and Recognition, 483-497, 2023 | 16 | 2023 |