TITLE: JANA Corpus: Part 1 v1.0 (Arabic Human-Human Dialogues Corpus) Authors: Abdelrahim Abdelsabour Elmadany, Sherif M. Abdou, Mervat Gheith INTRODUCTION JANA is a multi-genre corpus of Arabic dialogues labeled for Arabic Dialogues Language Understanding (ADLU) at the utterance level. We collected dialogues from different genre call centers such as Banks, Flights, and Mobile Network providers; these dialogues consist of transcribed phone calls and instant messages for inquiries regarding providing service from call centers. In this release, we provide annotated data consist of approximately 3001 turns with average 6.7 words per turn, contains 4725 utterances with average 4.3 words per utterance, and 20311 words SOURCE DATA JANA corpus consist of two categories first, spontaneous spoken dialogues second, IM dialogue. Indeed, our corpus is the only one, which have the two categories. To the best of our knowledge, all of the DAs corpora have only one category. The first category, the spoken dialogues has collected from Banks and Flight call-centers; it has recorded since August 2013 and contains 52 phone calls recorded from Egyptian’s banks and Egypt Air Company call-centers with an average duration of two hours of talking time after removing ads from calls. It consists of human-human discussions about providing services e.g. Create new bank account, service request, balance check and flight reservation. The second category, IM dialogues, which contain 30 chat dialogues, collected from the mobile network operator’s online-support ‘KSA Zain, KSA Mobily, and KSA STC’. PUBLICATION Elmadany, A.A., Abdou, S.M., Gheith, M., 2015a. JANA: An Arabic Human-Human Dialogues Corpus, Recent Trends in Information Systems (ReTIS), 2015 IEEE 2nd International Conference on. IEEE, Jadavpur University, Kolkata, India, pp.347-352. you can download the paper at http://ieeexplore.ieee.org/xpl/freeabs_all.jsp?arnumber=7232903&abstractAccess=no&userType=inst CONTACTS AbdelRahim A. Elmadany Department of Computer Science Institute of Statistical Studies and Research (ISSR), Cairo University, Egypt ar_elmadany@hotmail.com Sherif M. Abdou Department of Information Technology Faculty of Computers and Information, Cairo University, Egypt sh.ma.abdou@gmail.com Mervat Gheith Department of Computer Science Institute of Statistical Studies and Research (ISSR), Cairo University, Egypt. mervat_gheith@yahoo.com