Title : Readme file for Kasdi-Merbah(University) Emotional Database in Arabic Speech (KEDAS):

Author(s) :---------------------------------------------------
Mourad Belhadj 
Ilham Bendellali
Elalia Lakhdari


KEDAS is an acted audio database for emotional speech geared towards speech emotion recognition systems.
It was designed in the modern Arabic language, which is symbolized in ISO 639-3 by [arb], and it was developed in two months of work.
-This work is issued by the Faculty of Modern Technologies for Information and Communication of the University of Kasdi Merbah, Ouargla, Algeria.
It is an audio representative database of emotional speech in Arabic which took two months.
Written consent was obtained prior to registration from all participants.

Individuals gave informed consent for participant data (name, surname, and age), where the participants in the database were informed that their personal information would not be
published and would be deleted, except for their audio recordings, moreover, the recordings will be
used in scientific research only. 

-------------------------------------------------------------
Linguistic material:
____________________________________________________________
• Stop now                                         توقف الآن • 
____________________________________________________________
• Please act politely                         تصرف بأدب من فضلك•
____________________________________________________________
• I am so sorry                                    أنا آسف حقا•
____________________________________________________________
• I know this is hard                           أعلم أن هذا صعب•
____________________________________________________________
• Be careful it falls down                      إحذر سيسقط منك•
____________________________________________________________
• Is there another solution?                    آلا يوجد حل آخر•
____________________________________________________________
• This is really great                            هذا رائع حقا•
____________________________________________________________
• Thank you very much                            شكرا جزيلا لك•
____________________________________________________________
• The second office is on the right         المكتب الثاني على اليمين • 
____________________________________________________________
• What can i do for you?                            تفضل •
____________________________________________________________

Overview info :

__________________________________________________________________________ 
  
Data-set name         |  kasdi-merba emotional database in Arabic speech
__________________________________________________________________________
 Date                 |  June  2022
__________________________________________________________________________
 Dataset type        |  Acted   
_________________________________________________________________________
 File Type            |  Audio only
_________________________________________________________________________
 format               |  .FLAC
__________________________________________________________________________
 Sampling rate        |  44 KHz*
__________________________________________________________________________
 ? of speakers        |  500 
__________________________________________________________________________
 ? of female speakers |  254    
_______________ __________________________________________________________
 ? of male speakers   |  246 
__________________________________________________________________________

  Age of speakers     |  [20-70 year]
___________________________________________________________________________
 ? of emotional states|  05 emotions [sadness, fear, anger, happiness, neutral] 
____________________________________________________________________________
 ? of statements      |  10 = [9 sentences + 1 word] 
____________________________________________________________________________
 ? of audio clips     |  5000 audio files
____________________________________________________________________________
 
Size                 |  282 Mb
____________________________________________________________________________

duration of each clip |  [ 0.5 s to 2.5s ]                     
_____________________________________________________________________________
hardware, Software    |  Zoom h8 device, Audacity audio editor                    
_____________________________________________________________________________
   
Duration           |  02:04   
_____________________________________________________________________________
Accessibility         |  public 


*With the exception of Actor 309 which is 48 kHz

All audio files in the database are encrypted as follows :
The name of each file is divided into seven parts in the following order

(1)- Actor id:       First actor number to last actor [001 to 500 ]
---------------------------------------------------------------
(2)- Gender:          m=male, f=female
---------------------------------------------------------------
(3)- Age group:       [01]=20-30 * [02]=30-40 *  [03]=40-50 * (years)
----------------------------------------------------------------
(4)- Emotion:         E1=Angry, E2=Sadness, E3=Fear, E4=happiness, E5=neutrality
----------------------------------------------------------------
(5)- Statement:       [01]=first statment, [02]=second statement.

ACKNOWLEDGEMENTS

The Directorate-General for Scientific Research and Technological Development (DGRSDT) and The Thematic Agency for Research in Science and Technology (ATRST) supported this work. We acknowledge their funding and support. The Linguistic Research Unit and Arabic Language Issues in Algeria (URLCLAA, Ouargla, Algeria) are acknowledged for their contribution to studying emotions in the KEDAS Dataset. The Artificial Intelligence and Information Technologies Laboratory (LENATI, University Kasdi Merbah Ouargla) is acknowledged for its support, direction, and advice.