Corpus del Español Actual (CEA)

Home > Corpus-linguistics, Spanish, websites > Corpus del Español Actual (CEA)

Corpus del Español Actual (CEA)

2012/04/26 plagwitz Leave a comment Go to comments

Link:
Example of KWIC view result:

Based on Europarl, Wikicorpus (2006!), MultiUN. From their metadata page:

Metadata for Corpus del Español Actual
Corpus name	Corpus del Español Actual
CQPweb’s short handles for this corpus	cea / CEA
Total number of corpus texts	73,010
Total words in all corpus texts	539,367,886
Word types in the corpus	1,680,309
Type:token ratio	0 types per token
Text metadata and word-level annotation
The database stores the following information for each text in the corpus:	There is no text-level metadata for this corpus.
The primary classification of texts is based on:	A primary classification scheme for texts has not been set.
Words in this corpus are annotated with:	Lemma (Lemma)
	Part-Of-Speech (POS)
	WStart (WStart)
The primary tagging scheme is:	Part-Of-Speech
Further information about this corpus is available on the web at:	http://sfn.uab.es:9080/SFN/tools/cea/english

To use, “consult the IMS’s brief description of the regular-expression syntax used by the CQP and their list of sample queries. If you wish to define your query in terms of grammatical and inflectional categories, you can use the part-of-speech tags listed on the CEA’s Corpus Tags page.”
Also provides frequency data (based on word forms or lemmas, and others – up to a 1000):
Examples of a frequency query result (click for full-size image. Note that a lemmatized list was requested here which links all inflected forms back to the lemma, and vice versa, upon clicking the lemma, displays a KWIC view containing all forms subsumed under that lemma, see picture above):

Categories: Corpus-linguistics, Spanish, websites Tags: links

Comments (0) Trackbacks (0) Leave a comment Trackback

No comments yet.

No trackbacks yet.

Leave a comment Cancel reply

This site uses Akismet to reduce spam. Learn how your comment data is processed.

Calendar ICS corruption How a teacher can use Sanako voice insert to easily add spoken comments to students’ Sanako oral proficiency exams

Questions? Read the About. Or just ask me a quick Our Databases: Resources with calendars -- Language learning material Moodle Sites, multimedia files -- films
FAQs for LRC student staff or for students or for teachers. To search our FAQs, in the browser addressbar, add after "https://plagwitz.wordpress.com/feed/?tag=faqs+/" "+TAG1" (from tag cloud below) OR "https://plagwitz.wordpress.com/feed/tag=faqs
&category_name=" "CAT1" (from category hierarchy below). OR search both categories and tags, and multiple TAGs/CATs (connect with "," for OR-search, with "+" for AND-search), like so: https://plagwitz.wordpress.com/feed/?tag=TAG1+TAG2+...TAGn&category_name=CAT1
+CAT2+...CATn"
Other ways to find help

If you cannot find it here, look there: 5,500 Language-Learning Links and Programs for learning or teaching 150 languages
Shortcuts:Our Lists, Our Maps, LRC Staff Moodle Site,LRC Project Moodle Site, 49erexpress, UNCC Moodle, Student Recordings: s:claslcslrcsanakostudent
Learning usage samples: Sanako oral exam, Kaltura webcam presentation, Dictation with speech recognition, Sanako written exam, Chinese and Japanese interactive stroke-order practice
Test the Sanako Installer, Webbrowser Popup Konfigurator for XP, or Windows7, faster LRC TeacherPC Log-in Let MS facilitate diacritics writing by installing for you US-International keyboard layout
This is my personal blog (Google+). The views expressed on these pages are mine alone and not those of my employer. The information in this weblog is provided “AS IS” with no warranties, and confers no rights.

Thomas' Work Space

Corpus del Español Actual (CEA)

Leave a comment Cancel reply

Blog Stats

Thank you for your response. ✨

Top Posts & Pages

Top Clicks

Categories

Email Subscription

Archives

Top

Thomas' Work Space

Corpus del Español Actual (CEA)

Share this:

Related

Leave a comment Cancel reply

Blog Stats

Thank you for your response. ✨

Top Posts & Pages

Top Clicks

Categories

Email Subscription

Archives

Top