Skip navigation.
"share and discuss ideas, promote your research and discover the digital arts and humanities"
(create account)

group: Text mining

This group originated from the Methods Network workshop Text Mining for Historians. Organised by AHDS History and the Association for History and Computing UK (ACH-UK) and building upon the successful Methods Network Workshop on Historical Text Mining in Lancaster in July 2006. We are interested in text mining tools and methods, linguistic analysis and corpus methods.

Methods Network audio: Dawn Archer - Love - a familiar or a devil? An Exploration of Key Domains in Shakespeare's Comedies and Tragedies


00:29:49 minutes (27.31 MB)

This is an audio recording of the presentation 'Love - a familiar or a devil? [read more...]

Methods Network audio: Paul Baker - "The question is, how cruel is it?" Keywords in Debates on Fox Hunting in the British House of Commons


00:23:43 minutes (21.72 MB)

This is an audio recording of the presentation '"The question is, how cruel is it?" Keywords in Debates on Fox Hunting in the British House of Commons' given by Paul Baker, University of Lancaster, UK, at the Methods Network expert seminar on linguistics: Word Frequency and Keyword Extraction (Lancaster University, 8 Sep 20 [read more...]

Methods Network audio: Christian Kay - Issues for Historical Corpora: First Catch Your Word


00:27:07 minutes (24.84 MB)

This is an audio recording of the presentation 'Issues for Historical Corpora: First Catch Your Word' given by Christian Kay, University of Glasgow, Scotland, at the Methods Network expert seminar on linguistics: Word Frequency and Keyword Extraction (Lancaster University, 8 Sep 2005).

Methods Network audio: David Hoover - Word Frequency, Statistical Stylistics, and Authorship Attribution


00:30:32 minutes (27.83 MB)

This is an audio recording of the presentation 'Word Frequency, Statistical Stylistics, and Authorship Attribution' given by David Hoover, New York University, USA, at the Methods Network expert seminar on linguistics: Word Frequency and Keyword Extraction (Lancaster University, 8 Sep 2005).

Methods Network audio: John Kirk - Word Frequency: Use or Misuse


00:29:18 minutes (26.83 MB)

This is an audio recording of the presentation 'Word Frequency: Use or Misuse' given by John Kirk, Queen's University Belfast, Northern Ireland, at the Methods Network expert seminar on linguistics: Word Frequency and Keyword Extraction (Lancaster University, 8 Sep 2005).

event: AACL 2008 - American Association for Corpus Linguistics

12/03/2008 - 12:00
15/03/2008 - 19:00
Etc/GMT-7

Invited speakers (alphabetical order):

Harald Baayen, University of Alberta (Canada)
Doug Biber, Northern Arizona University (United States)
Laurel Brinton, University of British Columbia (Canada)
Susan Hunston, University of Birmingham (UK)
Tony McEnery, Lancaster University (UK)

Place: Brigham Young University. Provo, Utah, USA
Website: http://corpus.byu.edu/aacl2008

event: A Virtual Research Environment for the Study of Documents and Manuscripts (Charles Crowther)

17/08/2007 - 16:30
17/08/2007 - 18:30
Etc/GMT

The scholar interpreting an ancient documentary text has a broad range of relevant electronic tools available; but the interaction is largely in one direction and the experience is fragmented by the dispersal of the electronic resources. [read more...]

video: Theorizing from Data (Peter Norvig) at Google Developers Day US

''It is a capital mistake to theorize before one has data.' Sir Arthur Conan Doyle's words from 1891 remain true today. Researchers in computational linguistics and information retrieval now have a million times more data than was available 30 years ago. This talk explores what this data can do for problems in language understanding, translation, information extraction, and inference, and extrapolates to what more data may bring in the future.' [read more...]

forum: Corpus linguistics, text mining, textual analysis, data mining

Quote:

Multiple Text Datasets
Corpora
Corpus linguistics
Text Mining
Textual Analysis
Data Mining

[read more...]

Methods Network blog: Text mining

Two weeks ago I was in Glasgow, discussing Text Mining for Historians. The workshop started with a couple of presentations that gave a more general introduction into the field, describing specific projects, tools or concepts such as corpus linguistics. [read more...]

forum: Workshop Materials

Attached are handouts and presentations given at the workshop Text Mining for Historians. [read more...]

  • Mark Greengrass: "Data Extraction Across Multiple Text Datasets for Arts and Humanities Research"
  • Dawn Archer: "Keywords and key domains ... in the Trial of the "The Rugeley Poisoner" (William Palmer)"

event: Can computers ever read ancient texts?

03/08/2007 - 16:30
03/08/2007 - 18:30
Etc/GMT

Digital Classicist/ICS Work in Progress Seminar, Summer 2007 - Melissa Terras (University College London) [read more...]

Syndicate content