Summer 2015 Projects at The LINGUIST List

It has been a busy summer here at The LINGUIST List! Please take a moment to check out the projects that our 2015 summer interns and volunteers have been working on!

  Alec      Clara

Edvard Bikbaev

Edvard Bikbaev works on the GORILLA project at the LINGUIST List. To that end, he is creating and annotating the speech corpus for Russian, his native language. The speech corpus Edvard is involved with includes multiple annotated tiers and will be further used to train a forced aligner.  In addition, Edvard translates contents of the GORILLA website, and updates MultiTree with linguistic publications in Russian. Edvard plans to apply for a PhD program in Computations Linguistics and use the Russian speech corpus he has created at Linguist List for his
dissertation.

Alec Wolyniec

Alec spends most of his time at the LINGUIST List creating the official LINGUIST List Google Chrome App, which will soon provide easy access to the upcoming GeoLing map and other LINGUIST List resources. He is also in the process of writing a script that automatically collects language data from Wiktionary and other open-source databases, and has so far used the program to extend the LINGUIST List’s Yiddish lexicon.

Clara García Gómez

Clara is mainly involved in the GORILLA Project creating a speech corpus for Castilian Spanish, of which she is a native speaker. She is creating materials necessary for automatic alignment and transcription. She also works on the translation of parts of the website into Spanish and in some editing tasks for LINGUIST List. She is interested in the study of undocumented languages so she is happy to participate in GORILLA and hope to contribute to this project further after creating the corpus for Castilian Spanish.

Jacob Henry

Jacob has spent most of his time working on the LL-MAP project, a large collection of maps containing linguistic  and geographic information to be used by linguists, anthropologists, and other researchers.The LINGUIST List relocation Indiana University became an opportunity to relaunch and redesign the technologies. This has involved porting all of the data accumulated to new servers and testing various file formats to find the easiest to work with for our purposes. We’ve made some progress and ideally, we would be able to relaunch LL-MAP by the end of the summer.

Seyed Asghari

Seyed started working on Baharlu dialect of south Azeri Turkic language. It is a language that is being spoken in west Iran with the neighboring area of Persian, Kurdish, and Lori languages. He studied different writing styles used to produce the most suitable transcriptions. Moreover, he needed to study the standards of romanization of Baharlu Turkic. He worked on sample recordings, creating transcription, romanization, and translation.

During this work he has also started preparing a Baharlu-English dictionary that including original word, romanization, English translation and will be completed with other elements such as lemma, PoS and pronunciation information.

Petar Garžina

For the last two weeks, Petar has been mainly working on the Automatic Speech Recognition Project. Currently, he is working on the Croatian speech corpus and ASR. The first part of the project consists of making recordings and transcribing them. Along with building the corpus, he has been going through the documentation about Chrome Apps, and from the beginning of this week, he will start working alongside Alec on the LINGUIST List Chrome app. At the end of his internship, he would like to have a working Croatian Speech Recognizer, and an application that will ease the use of various LINGUIST List features.

Zac Branson

Zac has been working primarily on the front and back end of Geoling which can be found at geoling.linguistlist.org. Zac has additionally contributed to the Gorilla project (gorilla.linguistlist.org) including the development of resources to be provided by Gorilla.

Hilaria Cruz at LINGUIST List

We at the LINGUIST List are always happy to collaborate with fellow scholars on our projects. We were lucky to host Dr. Hilaria Cruz, a researcher and speaker of Chatino, for a week while she worked on creating a spoken corpus of the language for an ongoing project. If you’re interested in collaborating on spoken corpora with us, please contact us!

Dr. Cruz at LINGUIST List

Hilaria Cruz is a linguist and a native speaker of San Juan Quiahije (SJQ) Chatino, an endangered Zapotecan language, spoken in the mountains of Oaxaca, Mexico. She has been documenting and revitalizing the Chatino languages since 2003. Hilaria founded the Chatino Language Documentation Project (CLDP) together with her sister Emiliana Cruz (now an assistant professor at UMass Amherst), and their advisor Tony Woodbury of The University of Texas at Austin.

The CLDP aims to carry out linguistic documentation projects and research integrating the advancement of linguistic science with the wishes of the Chatino people to promote and honor their language. During the course of Hilaria’s fieldwork on Chatino, she has personally collected and archived more than one hundred hours of audio recordings of naturalistic speech in formal and informal settings.

Hilaria earned her Ph.D. in linguistics in 2014 at the University of Texas at Austin. The dissertation entitled “Linguistic Poetics and Rhetoric of Eastern Chatino of San Juan Quiahije,” analyzes the poetic patterns of SJQ discourse.

Hilaria is currently working on a project with LINGUIST List to create tools for speech recognition in SJQ Chatino. Beginning in the fall of 2015 Hilaria will be a Lyman T. Johnson Postdoctoral fellow at the University of Kentucky. There Hilaria will investigate, the Chatino concepts of death in four Eastern Chatino communities. They are Santa Maria Yolotepec (YOL), Santa Maria Amialtepec (AMIA) and San Juan Quiahije (SJQ) and San Marcos Zacatepec (ZAC).  Hilaria’s research interests include Chatino poetics and verbal art, language revitalization, and automatic speech recognition in Chatino.

A Visitor to LINGUIST List

In the normal course of running The LINGUIST List, we are occasionally lucky enough to receive visitors. Last week, Dr. Francis M. Tyers, a post doc in computational linguistics at The University of Tromsø stopped by the office to discuss various computational projects. Dr. Tyers is in town collaborating with local linguists as well enjoying the abundant sunlight – surely a treat coming from the Arctic Circle!

Dr. Tyers (center) with Andrew Lamont (left) and Jonathan Washington (right)

Dr. Tyers has been involved in the field of machine translation for nine years, he completed his PhD at the Universitat d’Alacant, and now works as a postdoctoral researcher at UiT Norgga árktalaš universitehta. He has published over 30 articles related to machine translation and computational linguistics. He is secretary of both the ISCA SIG on Speech and Language Technology for Minority Languages (SaLTMiL) and the Apertium project. His research interests include finite-state morphological analysis, rule-based disambiguation and machine translation for marginalised and lesser-resourced languages. He is currently in Bloomington working on dependency parsing for Kazakh.

Should you be on the IU campus in Bloomington, we would love to meet you. Please come and see us!

2015 Summer Interns and Volunteers

We at LINGUIST List are delighted this summer to open our doors to the 2015 Summer Interns! If you are interested in becoming an intern, be on the look-out for our application cycle to open again next spring. In the mean time, there are other ways to get involved here at LINGUIST List. Just contact us for more information.

Take a look below to meet the newest members of the LINGUIST List:

Seyed Asghari

Seyed Amir Hossein Asghari is a doctoral candidate in the Near Eastern Languages and Cultures department at the University of Indiana, Bloomington. He has been the project manager for the first Persian-Albanian Dictionary (2010) and co-author of Persian-Albania and English conversation (2008).

He is currently working at Baharli South Azeri Turkish of Iran at Linguistic List.

Zac Branson

blog_pic

Zac recently completed his second year in the PhD program in the Department of Linguistics at Indiana University. He is pursuing coursework for an M.A. in Linguistics and an M.S. in Computational Linguistics. Zac’s research interests include the documentation of understudied and endangered languages, and the development of computational tools to aid such documentation.

Petar Garzina

Petar comes from Zadar, Croatia where he is currently finishing an MA at the Department of Linguistics at the University of Zadar. His fields of interest are computational linguistics, particularly natural language processing, and authorship attribution. During his free time he regularly works out (crossfit), when the weather is nice he also spear fishes and when the weather is bad he likes to watch a good movie, or play online video games.

Jacob Henry

Jacob is currently an intern at the LINGUIST List for summer 2015. His main projects include the relaunch of the LL-Map website as well as assisting with the launch of the GORILLA site. He’s originally from Muncie, Indiana and in 2011 he became a student at the University of Oklahoma where he’s currently pursuing a BA in French and General Linguistics. His particular academic interests lie in sociolinguistics, historical linguistics, and typology. He has also done research in various periods of French literature, as well as interning in a microbiology lab.

 

Umida Khikmatillaeva

 

 

Umida started volunteering for the Linguist List in the Spring of 2015. During 2012-2014 she worked at IU for the Turkish Flagship Program; her task was creating Turkish to Uzbek Bridge project materials. Prior to this program, she worked at the Center for Turkic and Iranian Lexicography and Dialectology (CTILD). Together with her colleagues, they created an Uzbek-English online dictionary. She has been working for IU since 1996 and taught Intermediate level Uzbek at Summer Workshop in Slavic, East European and Central Asian Languages (SWEESL) till 2003, then she coordinated Advanced Uzbek Program (Summer Overseas Program) in Samarkand in 2004.

 

Levi King

Levi joined LINGUIST List as a volunteer for the summer of 2015, where he’s contributing to the GORILLA project and related speech recognition work. He’s currently a Ph.D. student in Computational Linguistics (CL) at Indiana University, where he previously got a dual M.A. in CL (Department of Linguistics) and Applied Linguistics & TESOL (Teaching English to Speakers of Other Languages; Department of Second Language Studies). While his CL interests run the gamut, he’s particularly interested in applying natural language processing to the automatic content analysis of non-native speaker language, and more broadly, the analysis of “noisy” language data in general. In his free time, he enjoys live music, board and video games, trivia, pottery and comic books.

Alec Wolyniec

Photo on 6-22-15 at 7.20 PM #2

Alec is an intern at the LINGUIST List for the summer of 2015. His work includes assisting with the development of the LL-Map website, updating databases to be used in the development of Automatic Speech Recognition technologies, and creating algorithms to scrape language data from Wiktionary and other websites. Originally from the suburbs of New York City, he is currently a student at Emory University in Atlanta, where he is pursuing a BS in Computer Science and a secondary major in Linguistics. In his spare time, Alec enjoys jazz music, reading, basketball, and board games.

Fund Drive 2015 is Over! Thank You to All of Our Supporters!

Dear LINGUIST List Subscribers, Supporters and Friends,

Fund Drive 2015 has come to a close, and we would like to thank everyone who made a donation—big or small—for their generosity! This Fund Drive was no small effort, and we appreciate all of the support that we received from
nearly 700 donors!

To wrap up Fund Drive 2015, we would like to conclude with the results of our challenges:

– The Subfield Challenge

While phonology and syntax held the lead for the first half of the drive, it was computational linguistics that came out on top! Check out how the top five subfields ranked:

1. Computational Linguistics ($7,790)
2. Syntax ($6,121)
3. Sociolinguistics ($3,998)
4. Phonology ($3,129)
5. Semantics ($2,845)

– The University Challenge

Indiana University Bloomington and The University of Washington were in a stiff battle throughout the drive, but who ended up on top? Check out the results:

1. Indiana University Bloomington ($2,730)
2. University of Washington ($2,590)
3. Stanford University ($1,365)
4. North-West University, Potchefstroom and Vaal Triangle Campuses, South Africa ($820)
5. University of Arizona ($750)

– The Business Challenge

We also received some donations from a few very generous businesses:

1. Google Inc. ($4,000)
2. Microsoft Natural Language Group ($300)
3. IBM Watson ($150) and IBM Context Computing ($150)

We would also like to make a special mention of our Advisory Panel, whose efforts during our Advisors’ Challenge and all throughout the drive were invaluable. Not only were their donations vital to Fund Drive, but their willingness to spread the word and raise awareness brought great life to our efforts. We send them our sincerest thanks!

We are incredibly grateful for each and every donation that we received totaling $41,091.85, but we still did not come close to our goal of $79,000. Although Fund Drive 2015 is over, you can continue supporting us with one-time or recurrent donations by selecting The Linguist List Discretionary Fund (see the Instructions page):

https://www.myiu.org/one-time-gift

Please consider making a donation to keep The LINGUIST List running the way you like it! LINGUIST List is dedicated to freely providing information and services to the linguistic community, and it’s through your support that we’re able to do it.

We thank you all for your support during Fund Drive 2015!

Best wishes,
The LINGUIST List Team

Win a Copy of The Cambridge Handbook of Generative Syntax!

Dear LINGUIST List Readers,

For today’s prize, we are giving away a copy of “The Cambridge Handbook of Generative Syntax” donated by Cambridge University Press.  Read more about the book here:

http://goo.gl/sCyinf

Valued at $150, this book could be yours for a donation of $50 or more!  Enter to win by donating before noon tomorrow (May 7, 11:59 AM EST).  The lucky winner will be selected at random for pool of donors.

This is a limited time offer so to be eligible to win, donate today! You can donate at the link below:

http://goo.gl/e656LG

In addition to the one-time donations to our Fund Drive, you can also become a recurring donor and support LINGUIST List on a long-term basis.  Find out how by following this link:

http://goo.gl/Q27jls

Thanks and good luck!

Linguistically yours,

The LINGUIST List Crew

One-Day Giveaway: Donate Now to Win Pragmatics and Prosody in English Language Teaching from Springer!

Dear LINGUIST List Readers,

For today’s prize, we are giving away a copy of Pragmatics and Prosody in English Language Teaching, edited by Jesús Romero-Trillo and published by Springer. You can view a summary of the book here:

https://goo.gl/9R8Z7H

Valued at $179, it can yours for at least a $50 donation.  Donate tomorrow before noon (April 28, 11:59 AM EST) to be eligible to win.  You can donate at the link below:

http://goo.gl/e656LG

In addition to the one-time donations to our Fund Drive, you can also become a recurring donor and support LINGUIST List on a long-term basis.  Find out how by following this link:

http://goo.gl/Q27jls

Please spread the word about our Fund Drive.  You can do so by liking, sharing, and retweeting on social media.  Every bit of support helps and is vital to the continuing existence of LINGUIST List.

Thanks and good luck!

Linguistically yours,

The LINGUIST List Crew

 

Limited-time Giveaway: Win a Copy of The Rise of Writing from Cambridge University Press!

Dear LINGUIST List Readers,

For our Fund Drive raffle today, Cambridge University Press is donating two copies of The Rise of Writing, by Deborah Brandt.  Read the book summary here:

http://goo.gl/8lP5nD

Donate before midnight tomorrow (May 2, 11:59 pm EST), and two of you could be the lucky winners!  You can donate by following the link below:

http://goo.gl/e656LG

In addition to the one-time donations to our Fund Drive, you can also become a recurring donor and support LINGUIST List on a long-term basis.  Find out how by following this link:

http://goo.gl/Q27jls

Please be sure to spread the word about our Fund Drive – like, share, retweet.  We appreciate all of your support!

Thanks and good luck!

Linguistically yours,

The LINGUIST List Crew

 

 

Donate Today and Win a One-Year Subscription to Linguistics Vanguard by De Gruyter Mouton!

Dear LINGUIST List Readers,

For today’s prize, we are giving away two one-year online subscriptions to the new multimodal journal Linguistics Vanguard from De Gruyter Mouton.  You can read more about the journal here:

http://goo.gl/yu2UgH

Normally valued at $149, a one-year subscription can be yours for a donation of at least $50!  Donate before noon tomorrow (May 1, at 11:59 AM EST) to be entered to win.  Donate at this link here:

http://goo.gl/e656LG

In addition to the one-time donations to our Fund Drive, you can also become a recurring donor and support LINGUIST List on a long-term basis.  Find out how by following this link:

http://goo.gl/Q27jls

Thanks and good luck!

Linguistically yours,

The LINGUIST List Crew

One-day Giveaway: Win a Copy of The Body in Language from Brill!

Dear LINGUIST List Readers,

For today’s prize, we are giving away a copy of “The Body in Language” donated by Brill, a great resource for anyone interested in cognitive linguistics, lexical semantics or language and culture.  Read more about the book here:

http://goo.gl/zDemHn

Valued at $163, this book could be yours for a donation of $50 or more!  Enter to win by donating before noon tomorrow (April 30, 11:59 AM EST).  The lucky winner will be selected at random for pool of donors.

This is a limited time offer so to be eligible to win, donate today! You can donate at the link below:

http://goo.gl/e656LG

In addition to the one-time donations to our Fund Drive, you can also become a recurring donor and support LINGUIST List on a long-term basis.  Find out how by following this link:

http://goo.gl/Q27jls

Thanks and good luck!

Linguistically yours,

The LINGUIST List Crew