PARALLEL KORPUSGA MA’LUMOT YIG‘ISHNING AVTOMATLASHTIRILGAN USULLARI
Referat
Ushbu maqolada parallel matnlarni yig‘ish jarayonida sun’iy intellekt (AI) va tabiiy tilni qayta ishlash (NLP) texnologiyalaridan foydalanish ko‘rib chiqiladi. Veb-skraping, mashinali o‘rganish va metama’lumotlar bo‘yicha filtrlash usullari orqali parallel korpus yaratish va tarjima sifatini avtomatik aniqlash imkoniyatlari tahlil qilinadi. Ushbu yondashuvlar parallel korpus yaratishni avtomatlashtirish va tarjima tizimlarining sifatini yaxshilashga xizmat qiladi.
Mualliflar haqida
Adabiyotlar ro'yxati
Bergsma, S., & Dredze, M. (2009). No Mention of Parallel Corpora: Web-Based Corpus Construction for Multilingual Language Processing. In Proceedings of the ACL-IJCNLP 2009 Conference Short Papers.
Ranzato, M., Boureau, Y. L., & LeCun, Y. (2007). Learning Labeled and Unlabeled Structures for Text Classification. In Proceedings of the 24th International Conference on Machine Learning.
Sennrich, R., Haddow, B., & Birch, A. (2016). Neural Machine Translation of Rare Words with Subword Units. In Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics.
Koehn, P., & Knight, K. (2003). Empirical Methods for Scalable Parallel Text Processing. In Proceedings of the HLT-NAACL 2003 Workshop.
Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A., Kaiser, Ł., & Polosukhin, I. (2017). Attention Is All You Need. In Advances in Neural Information Processing Systems.
Ruder, S. (2017). Neural Transfer Learning for Natural Language Processing. In Proceedings of the 8th International Conference on Natural Language Processing.