• Paul CAPPEAU & Françoise GADET (Poitiers / Paris Ouest)
    Où en sont les corpus sur les français parlés?
    2007, Vol. XII-1, pp. 129-133
  • Paul CAPPEAU & Françoise GADET (Poitiers / Paris Ouest)
    The sociolinguistic exploitation of large corpora. Key-word and stone of wisdom
    2007, Vol. XII-1, pp. 99-110

    The desire to make use of large collections of oral data is nowadays largely shared by linguists. At a time when such tools are becoming increasingly available for French, it is important to make sure that there is sensitivity to all of those factors which guarantee reliability in the different stages of obtaining data: clarification of the term ‘corpus’; reflection on approaches to the field and to orality, and on representativeness (both in terms of genres and numbers of speakers); data elicitation practices and transcription.

  • Françoise GADET, Ralph LUDWIG, Lorenza MONDADA, Stefan PFÄNDER & Anne-Catherine SIMON (Paris Ouest / Halle, Allemagne / Bâle, Suisse / Fribourg, Allemagne / Louvain-la-Neuve, Belgique)
    A large corpus of spoken French : CIEL-F. Epistemological choices and empirical outcome
    2012, Vol. XVII-1, pp. 39-54

    This article presents the structure of the Corpus International Ecologique de la Langue Française, an extensive corpus of spoken French that will soon be available on the Internet, from both an epistemological and empirical perspective. Explanations are given with regard to the ideas that guided the data collection (ecological approach, comparability of the different areas of the Francophonie and communication situations) and to the choices made ("communicative spaces" and "activity types") with a view to relevant analyses in various research fields (variation, interaction, multimodality, French in contact, oral syntax) and an attempt is made to fill existing gaps in the current corpus. The article further addresses the issue of building up a network of experts, problems that had to be solved during fieldwork in the different areas and questions concerning standardisation, archiving and publication of the collected data (audio and video recordings, transcriptions, metadata), whereupon several examples are presented for comparative analyses.