Publications of András Kornai

Most files are stored on the server both as PostScript and as PDF, but the PS is gradually phased out. PS files are gzipped: if you have problems downloading them you may need to set content type to PostScript, and content encoding to x-gzip. The PDF versions are not compressed.


Mathematical Linguistics Springer Verlag. In the series Advanced Information and Knowledge Processing November 2007, ISBN 978-1-84628-985-9 Hardbound, x+290 pages. Order here Book Webpage

Formal Phonology In the series Outstanding Dissertations in Linguistics, Garland Publishing, 1995, ISBN 0-815-317301, hardbound, xxx+204 pages. Order Here pdf

On Hungarian Morphology In the series Linguistica, Hungarian Academy of Sciences, 1994, ISBN 963-8461-73-X, paperbound, 176 pages Order here pdf

Books edited

Proceedings of the 13th Biennial Meeting on Mathematics in Language (MOL13) Sofia, Bulgaria, August 9 2013 (Jointly with __ Marco Kuhlmann) ISBN 978-1-937284-65-7 ACL Anthology MOL Website

Proceedings of the 12th Biennial Meeting on Mathematics in Language (MOL12) Nara, Japan, September 6-8, 2011 (Jointly with Makoto Kanazawa, __, Marcus Kracht, Hiroyuki Seki) Springer LNCS 6878 2011, ISBN 978-3-642-23210-7 Order here

Finite-State Methods and Natural Language Processing Revised selected papers of the 8th FSMNLP International Workshop, Pretoria, South Africa, July 21-24, 2009 (Jointly with Anssi Yli-Jyrä, __, Jacques Sakarovitch, Bruce W. Watson) Springer LNCS 6062 2010, ISBN 978-3-642-14683-1 Order here

Proceedings of the HLT-NAACL Workshop on the Analysis of Geographic References (Jointly with __, Beth Sundheim) Association for Computational Linguistics, 2003, ISBN 1-932432-04-3 (WS9), paperbound, vi+81 pages.

Oxford International Encyclopedia of Linguistics (Editor in Chief: William Frawley) Area editor, mathematical linguistics. 4 vols, Oxford University Press, 2003, ISBN 978 0 19513 977 8 Order here Book webpage

Extended Finite State Models of Language In the series Studies in Natural Language Processing, Cambridge University Press, 1999, ISBN 0-521-63198-X, hardbound, x+278 pages. Order here Book Webpage


Lexical Semantics and Model Theory: Together at Last? (jointly with __, Marcus Kracht) In M. Kuhlmann, M. Kanazawa and G. Kobele (eds) Proc MOL 2015 51--61 pdf

Competence in lexical semantics (jointly with __, Judit Acs, Marton Makrai, David Nemeskey, Katalin Pajkossy, Gabor Recski) StarSem 2015, The 4th joint conference on lexical and computational semantics pdf

Finite automata with continuous input In S. Bensch, R. Freund, and F. Otto (eds) Short Papers from the Sixth Workshop on Non-Classical Models of Automata and Applications, Kassel 2014 pdf

Resolving the infinitude controversy 2014 Journal of Logic, Language, and Information 23/4 pp 481--492 DOI 10.1007/s10849-014-9203-2 pdf

Indian Subcontinent Language Vitalization (jointly with __, Pushpak Bhattacharyya) In G. N. Jha, K. Bali, S. L. Devi, E. Banerjee (eds) Proc. 2014 LREC Workshop on Indian Language Data: Resources and Evaluation (WILDRE2) 24-27 pdf

Bounding the impact of AGI Journal of Experimental and Theoretical Artificial Intelligence 2014 26/3 417--438 online pdf

Euclidean Automata In Mark Waser (ed) Implementing Selves with Safe Motivational Systems and Self-Improvement. Proc. AAAI Spring Symposium, Technical Report SS-14-03, 2014, AAAI Press 25--30 pdf

Digital language death PLoS ONE 8(10): e77056. doi:10.1371/journal.pone.0077056 and pdf 2013

Applicative structure in vector space models (jointly with Marton Makrai, David Mark Nemeskey, __) In A. Allauzen, H. Larochelle, R. Socher, Ch. Manning (eds) Proc. 2013 ACL Workshop on Continuous Vector Space Models and their Compositionality 59-63 pdf

Building basic vocabulary across 40 languages (jointly with Judit Acs, Katalin Pajkossy, __) In S. Sharoff, R. Rapp, P. Zweigenbaum (eds) Proc. Sixth Workshop on Building and Using Comparable Corpora 2013 52-58 pdf

Structure Learning in Weighted Languages (jointly with __, Attila Zseder, Gabor Recski) In A. Kornai, M. Kuhlmann (eds) Proc. 13th Mathematics of Language Workshop 2013 72-82 pdf

A practical approach to language complexity: a wikipedia case study (jointly with T. Yasseri, __, J. Kertész) PLoS ONE 7(11): e48386. doi:10.1371/journal.pone.0048386 and pdf 2012

Rapid creation of large-scale corpora and frequency dictionaries (jointly with A. Zséder, G. Recski, D. Varga, __) In N. Calzolari et al (eds) Proc. LREC 2012 Istanbul pdf

Eliminating ditransitives Ph. de Groote, M-J Nederhof (eds) Revised and Selected Papers from the 15th and 16th Formal Grammar Conferences Springer LNCS 7395 2012 243-261 pdf

Dynamics of conflicts in Wikipedia (jointly with T. Yasseri, R. Sumi, A. Rung, __, J. Kertész) PLoS ONE 7(6): e38869. doi:10.1371/journal.pone.0038869 and pdf

Probabilistic grammars and languages 2011 Journal of Logic, Language, and Information 20 317-328 pdf

Edit wars in Wikipedia (jointly with R. Sumi, T. Yasseri, A. Rung, __, J. Kertész) Proc. 3rd Intl Conf on Social Computing Cambridge MA 2011, 724-727 pdf

Finite state methods and models in natural language processing (Jointly with A. Yli-Jyrä, ___, Jacques Sakarovitch) Natural Language Engineering 2011 17/2 141-144 pdf

The treatment of ordinary quantification in English proper Hungarian Review of Philosophy Special issue on Imre Ruzsa -- a man of consequence. 2010 54/4 150-162 pdf

The algebra of lexical semantics In C. Ebert, G. Jäger, J. Michaelis (eds) Proc. 11th Mathematics of Language workshop (MOL11) Springer LNCS 6149 2010 174-199 pdf

Rekurzivak-e a természetes nyelvek? (Are natural languages recursive?) Magyar Tudomány 2010/8 994-1005 pdf

NP alignment in bilingual corpora (jointly with G. Recski, A. Rung, A. Zseder, ___) In N. Calzolari (ed) Proc. 7th International Conference on Language Resources and Evaluation (LREC'10) pdf

The complexity of phonology Linguistic Inquiry 2009 40 701-712 pdf

Google for the linguist on a budget (jointly with __ P. Halacsy) 2008 In S. Evert, A. Kilgarriff and S. Sharoff (eds) Proc. 4th Web as Corpus Workshop (WAC-4) 8-11 pdf

On the proper definition of information 2008 In T. Bynum, M. Calzarossa, I. de Lotto and S. Rogerson (eds) Proc. 10th Ethicomp Conference 488-495 pdf

Parallel creation of gigaword corpora for medium density languages - an interim report (Jointly with P. Halacsy, __, P. Nemeth, D. Varga). 2008 In N. Calzolari et al (eds) Proc. 6th International Conference on Language Resources and Evaluation (LREC'08) European Language Resources Association (ELRA) 858 pdf

HunPos -- an open source trigram tagger (Jointly with P. Halacsy, __, Cs. Oravecz). 2007 In S. Ananiadou (ed) Proc. ACL2007 Demo and Poster Sessions 209-212 pdf

Parallel corpora for medium density languages (Jointly with D. Varga, P. Halacsy, __, V. Nagy, L. Nemeth, V. Tron). In N. Nicolov, K. Bontcheva, G. Angelova and R. Mitkov (eds): Recent Advances in Natural Language Processing IV. Selected papers from RANLP-05 John Benjamins, 2007, 247-258 pdf

Using a morphological analyzer in high precision POS tagging of Hungarian (Jointly with P. Halacsy, __, Cs. Oravecz, V. Tron, D. Varga). 2006 In N. Calzolari and K. Choukri (eds) Proc. LREC 2006 2245-2248 pdf

Web-based frequency dictionaries for medium density languages (Jointly with __, P. Halacsy, V. Nagy, Cs. Oravecz, V. Tron, D. Varga). In A. Kilgariff and M. Baroni (eds) Proc. 2nd Web as Corpus Wkshp (EACL 2006 WS01) 1-8 pdf

Evaluating geographic information retrieval In C. Peters, F. Gey, J. Gonzalo, H. Mueller, G. Jones, M. Kluck, B. Magnini, and M. de Rijke (eds): Accessing Multilingual Information Repositories. Revised Selected Papers of the Cross-Language Evaluation Forum (CLEF 2005) Springer LNCS 4022, 928-938 pdf

Hunmorph: open source word analysis (Jointly with V. Tron, Gy. Gyepesi, P. Halacsy, __, L. Nemeth, D. Varga). In M. Jansche (ed): Proc. ACL 2005 Software Workshop 77-85 pdf

Creating open language resources for Hungarian (Jointly with P. Halacsy, __, L. Nemeth, A. Rung, I. Szakadat, V. Tron). In Proc. LREC 2004 203-210 ps pdf

Leveraging the open source ispell codebase for minority language analysis (Jointly with L. Nemeth, V. Tron, P. Halacsy, __, A. Rung, I. Szakadat). In J. Carson-Berndsen (ed): Proc. SALTMIL 2004 56-59 ps pdf

Automatic translation to controlled medical vocabularies (Jointly with __, L. Stone). In A. Abraham and L. Jain (eds): Innovations in Intelligent Systems and Applications Springer Verlag, 2004, 413-434 ps pdf

Classifying the Hungarian web (Jointly with __, M. Krellenstein, M. Mulligan, D. Twomey, F. Veress, A. Wysoker) In A. Copestake and J. Hajic (eds): Proc. EACL 2003 203-210 ps pdf

Explicit finitism International Journal of Theoretical Physics 2003/2 301-307 ps pdf background material

Mathematical Linguistics (Jointly with G.K. Pullum, __) In W. Frawley (ed): Oxford International Encyclopedia of Linguistics, Oxford University Press 2003, v3 17-20 ps pdf

Optical Character Recognition In W. Frawley (ed): Oxford International Encyclopedia of Linguistics, Oxford University Press 2003, v3 33-34 ps pdf

How many words are there? Glottometrics 2002/4 61-86 ps pdf

Linear Discriminant Text Classification in High Dimension. (Jointly with __, J.M. Richards) In A. Abraham and M. Koeppen (eds): Hybrid Information Systems Physica Verlag, Heidelberg, 2002 527-538 ps pdf

Recent Improvements in the BBN OCR System (Jointly with R. Schwartz, Zh. Lu, P. Natarajan, I. Bazzi, __, John Makhoul) In D. Doermann (ed): Proc 1999 Symposium on Document Image Understanding Technology 1999, 245-251 pdf

OCR of degraded documents using HMM-based techniques (Jointly with I. Bazzi, P. Natarajan, R. Schwartz, __, Zh. Lu, John Makhoul) In D. Doermann (ed): Proc 1999 Symposium on Document Image Understanding Technology 1999, 149-153 pdf

Zipf's law outside the middle range Proc. Sixth Meeting on Mathematics of Language University of Central Florida, 1999 347-356 ps pdf

A Robust, Language-Independent OCR System. (Jointly with Z. Lu, I. Bazzi, __, J. Makhoul, P. Natarajan, R. Schwartz) In: Robert J. Mericsko (ed): Proc. 27th AIPR Workshop: Advances in Computer-Assisted Recognition SPIE Proceedings 3584 1999 ps pdf

Quantitative Comparison of Languages. Grammars 1998/2 155-165 ps pdf

An Experimental HMM-Based Postal OCR System. In: Proceedings of ICASSP'97, IEEE Computer Society Press, Los Alamitos CA, IV, 3177-3180 ps pdf

Gépi ékezés (Computer generation of accent marks. Jointly with __, G. Tóth) In Magyar Tudomány 1997/4 400-410 ps pdf English summary and source code

Analytic models in phonology. In: J. Durand, B. Laks (eds): Current Trends in Phonology: Models and Methods CNRS, ESRI, Paris X 1996 395-418 ps pdf

Extended finite state models of language. In: Natural Language Engineering 1996/4 287-290 ps pdf

Comments on Mohri, Pereira, and Riley In: A. Kornai (ed): Proceedings of the W1 workshop of the 12th European Conference on Artificial Intelligence, Budapest 1996 73-74 ps pdf

Vectorized finite state automata. In: A. Kornai (ed): Proceedings of the W1 workshop of the 12th European Conference on Artificial Intelligence, Budapest 1996 36-41 ps pdf

Statistical zone finding. (Jointly with __, S.D. Connell) In: Proceedings of the 13th International Conference on Pattern Recognition, Vienna 1996, IEEE Computer Society Press, Los Alamitos CA, Vol III, 818-822 ps pdf

Recognition of cursive writing on personal checks. (jointly with __, K.M. Mohiuddin, S.D. Connell). In: Proceedings of the 5th International Workshop on Frontiers in Handwriting Recognition, Essex 1996 373-378 ps pdf

An HMM-based legal amount field OCR system for checks. (Jointly with __, K.M. Mohiuddin, S.D. Connell) In: Proc. Systems, Man, and Cybernetics, Vancouver, BC 1995 2800-2805 ps pdf

Relating phonetic and phonological categories In: E.S. Ristad (ed): Language Computations 1994 Providence, RI: American Mathematical Society, 21-36 ps pdf

Language models: where are the bottlenecks? AISB Quarterly 88 1994 36-40 ps pdf

The generative power of feature geometry. Annals of Mathematics and Artificial Intelligence 8 1993 37-46 ps pdf

Cataloging place names. Budapest Review of Books 3 1993 69-72 ps pdf In Hungarian: A nyelvészetrôl egyes szám elsô személyben Budapesti Könyvszemle 3 1993 110-113 ps pdf

Frequency in morphology. In: I. Kenesei (ed): Approaches to Hungarian IV 1992 246-268 ps pdf

Narrowness, pathwidth, and their application in natural language processing. (Jointly with __, Zs. Tuza) Discrete Applied Mathematics 36 1992 87-92 ps pdf

Hungarian Vowel Harmony. In: I. Kenesei (ed): Approaches to Hungarian III 1991 183-240 ps pdf

Nemzeti nyelv, nemzetközi tudomány. (National Language, International Science. Jointly with __, L. Kálmán) Nyelvtudományi Közlemények 92 1991 147-156 ps pdf

The X-bar Theory of Phrase Structure. (Jointly with __, G.K. Pullum) Language 66 1990 24-50 ps pdf

The Sonority Hierarchy in Hungarian. Nyelvtudományi Közlemények 91 1990 139-146 ps pdf

A fônévi csoport egyeztetése (Agreement in Noun Phrases). Általános nyelvészeti tanulmányok XVII 1989 183-211 ps pdf

Hungarian Sentence Intonation. (Jointly with __, L. Kálmán) In: H. van der Hulst and N. Smith (eds): Autosegmental studies on pitch accent. Foris, Dordrecht 1988 183-195 ps pdf

Compositionality, of, word-formation. Acta Linguistica 38 1988 118-130 ps pdf

Hungarian Vowel Harmony. In: M. Crowhurst (ed): Proceedings of the 6th West Coast Conference on Formal Linguistics. Stanford Linguistics Association 1987 147-161 image pdf

Logikai típusok és nyelvi típusok (Logical types and linguistic types ps pdf). In: I. Ruzsa (ed): Tertium Non Datur. Eötvös Loránd University, Budapest 1987 ps pdf

Finite state semantics. In U. Klenk, P. Scherber, and M. Thaller (eds): Computerlinguistik und philologische Datenverarbeitung Georg Olms Verlag, Hildesheim 1987 59-70 pdf

X-bar Grammars. In: J. Demetrovics, G.O.H. Katona, and A. Salomaa (eds): Algebra, Combinatorics, and Logic in Computer Science. North Holland 1986 523-536 images

Szótári adatbázis az akadémiai nagyszámitógépen (A dictionary database of Hungarian) Hungarian Academy of Sciences Institute of Linguistics Working Papers II 1986 65-79 ps pdf

The Internal Structure of Noun Phrases. In: I. Kenesei (ed): Approaches to Hungarian I 1985 79-92 ps pdf

Natural Languages and the Chomsky Hierarchy. In: M. King (ed): Proceedings of the 2nd European Conference of the Association for Computational Linguistics 1985 1-7 ps pdf

Lexical Categories and X-bar Features. Acta Linguistica 35 1985 117-131 ps pdf

Formális Stúdiumok (Formal Studies) Mimeographed lecture notes, Janus Pannonius University, Pécs 1984

A Finite Automaton for the English Auxiliary System. Hungarian Academy of Sciences Computer Science Institute Working Paper II/47 1982 images


US Patent 6,507,829: Textual data classification method and apparatus (Jointly with J.M. Richards, __) issued Jan 14 2003. html pdf

US Patent Application US20090119255 Methods of Systems Using Geographic Meta-Metadata in Information Retrieval and Document Displays (Jointly with J. Frank, __) published May 7 2009 pdf

Back to home page