Volgen
Mrinmaya Sachan
Mrinmaya Sachan
Assistant Professor, ETH Zürich
Geverifieerd e-mailadres voor inf.ethz.ch - Homepage
Titel
Geciteerd door
Geciteerd door
Jaar
Using content and interactions for discovering communities in social networks
M Sachan, D Contractor, TA Faruquie, LV Subramaniam
Proceedings of the 21st international conference on World Wide Web, 331-340, 2012
2392012
Distilling reasoning capabilities into smaller language models
K Shridhar, A Stolfo, M Sachan
arXiv preprint arXiv:2212.00193, 2022
167*2022
Contextual parameter generation for universal neural machine translation
EA Platanios, M Sachan, G Neubig, T Mitchell
arXiv preprint arXiv:1808.08493, 2018
1532018
Membership inference attacks against language models via neighbourhood comparison
J Mattern, F Mireshghallah, Z Jin, B Schölkopf, M Sachan, ...
arXiv preprint arXiv:2305.18462, 2023
1172023
Solving Electrical Networks to incorporate Supervision in Random Walks
M Sachan, D Hovy, E Hovy
Proceedings of the 22nd International Conference on World Wide Web, 109-110, 2013
1122013
Self-training for jointly learning to ask and answer questions
M Sachan, E Xing
Proceedings of the 2018 Conference of the North American Chapter of the …, 2018
1092018
Easy questions first? a case study on curriculum learning for question answering
M Sachan, E Xing
Proceedings of the 54th Annual Meeting of the Association for Computational …, 2016
1092016
Identifying metaphorical word use with tree kernels
D Hovy, S Srivastava, SK Jauhar, M Sachan, K Goyal, L Huiying, ...
Proceedings of the First Workshop on Metaphor in NLP, 2013
1072013
Effective use of bidirectional language modeling for transfer learning in biomedical named entity recognition
DS Sachan, P Xie, M Sachan, EP Xing
Machine learning for healthcare conference, 383-402, 2018
932018
When to make exceptions: Exploring language models as accounts of human moral judgment
Z Jin, S Levine, F Gonzalez Adauto, O Kamal, M Sap, M Sachan, ...
Advances in neural information processing systems 35, 28458-28473, 2022
882022
Can large language models infer causation from correlation?
Z Jin, J Liu, Z Lyu, S Poff, M Sachan, R Mihalcea, M Diab, B Schölkopf
arXiv preprint arXiv:2306.05836, 2023
852023
Learning answer-entailing structures for machine comprehension
M Sachan, K Dubey, E Xing, M Richardson
Proceedings of the 53rd Annual Meeting of the Association for Computational …, 2015
852015
Cladder: Assessing causal reasoning in language models
Z Jin, Y Chen, F Leeb, L Gresele, O Kamal, LYU Zhiheng, K Blin, ...
Thirty-seventh conference on neural information processing systems, 2023
73*2023
Logical fallacy detection
Z Jin, A Lalwani, T Vaidhya, X Shen, Y Ding, Z Lyu, M Sachan, ...
arXiv preprint arXiv:2202.13758, 2022
702022
Text-based rl agents with commonsense knowledge: New challenges, environments and baselines
K Murugesan, M Atzeni, P Kapanipathi, P Shukla, S Kumaravel, ...
Proceedings of the AAAI Conference on Artificial Intelligence 35 (10), 9018-9027, 2021
642021
Controlled text generation with natural language instructions
W Zhou, YE Jiang, E Wilcox, R Cotterell, M Sachan
International Conference on Machine Learning, 42602-42613, 2023
632023
A mechanistic interpretation of arithmetic reasoning in language models using causal mediation analysis
A Stolfo, Y Belinkov, M Sachan
arXiv preprint arXiv:2305.15054, 2023
612023
Agents: An open-source framework for autonomous language agents
W Zhou, YE Jiang, L Li, J Wu, T Wang, S Qiu, J Zhang, J Chen, R Wu, ...
arXiv preprint arXiv:2309.07870, 2023
562023
Autoregressive structured prediction with language models
T Liu, Y Jiang, N Monath, R Cotterell, M Sachan
arXiv preprint arXiv:2210.14698, 2022
562022
Recurrentgpt: Interactive generation of (arbitrarily) long text
W Zhou, YE Jiang, P Cui, T Wang, Z Xiao, Y Hou, R Cotterell, M Sachan
arXiv preprint arXiv:2305.13304, 2023
512023
Het systeem kan de bewerking nu niet uitvoeren. Probeer het later opnieuw.
Artikelen 1–20