It’s the Global Language, right? So how can it be language dependent? You propose a theory based on English. It has to apply to all languages. You propose a Natural Language Processing (NLP) or Computational Linguistics (CL) technique for a particular problem. For English. It applies to all languages. You build a software for some purpose. For English. It has to be useful for all languages. You build a dictionary…
But the vice versa is not true. You propose a theory based on Hindi. It is language specific. It doesn’t count for much. You propose an NLP technique for a particular problem. For Hindi. It is language specific. It doesn’t count for much. You build a software for some purpose. For Hindi. It is language specific. It doesn’t count for much.
That’s how it works in practice, if not theory. Or may be even in theory, with some help from the (very valid) idea of Universal Grammar (except that the UG may be the UG of English).
Even today I have got a review of a paper on a problem which is like one of the holy grails of NLP or CL. One of the comments is that the approach has been evaluated on Hindi so it can’t be compared to other techniques that already exist. True. But what is the number of papers published in the ‘first class’ NLP/CL conferences and journals in which the approach has been tried only on English? Doesn’t matter, because English is language independent. If you only evaluate your technique on English, that’s OK. But if you evaluate on only Hindi, that’s not acceptable. Because Hindi is language specific.
We know this very well in India. The Elite talks about (Indian) literature. And sometimes the Elite magnanimously (or dismissively) talks about (Indian) literature in languages. The first, of course, refers to literature in English. The second refers to literature in other languages. Indian languages.
The Elite talks of media. And the Elite (rarely and mostly negatively) talks of language media.
Hindi is a language. English is not a language.
Hindi is a language. English is the language.
English is above being merely a language.
That’s why all the work done in English is language independent. Not just research. Not just in NLP/CL. Anything. Movies, literature, music.
I am guilty of the sin of indulging too much in mere languages. I should be working mostly on English. Not just writing blog posts in English. Sometimes, of course, I can bestow a bit of my attention on languages. Like Hindi.
But I won’t do that. I will do the opposite. I am incurable.