Inferring and revising theories with confidence: analyzing bilingualism in the 1901 Canadian census

From National Research Council Canada

Download	View accepted manuscript: Inferring and revising theories with confidence: analyzing bilingualism in the 1901 Canadian census (PDF, 569 KiB)
DOI	Resolve DOI: https://doi.org/10.1080/08839510500313711
Author	Search for: Drummond, Chris; Search for: Matwin, S.; Search for: Gaffield, C.
Format	Text, Article
Conference	Proceedings of Applied Artificial Intelligence, 2006
Abstract	This paper shows how machine learning can help in analyzing and understanding historical change. Using data from the Canadian census of 1901, we discover the influences on bilingualism in Canada at the beginning of the last century. The discovered theories partly agree with and partly complement the existing views of historians on this question. Our approach, based around a decision tree, not only infers theories directly from data but also evaluates existing theories and revises them to improve their consistency with the data. One novel aspect of this work is the use of confidence intervals to determine which factors are both statistically and practically significant, and thus contribute appreciably to the overall accuracy of the theory. When inducing a decision tree directly from data, confidence intervals determine when new tests should be added. If an existing theory is being evaluated, confidence intervals also determine when old tests should be replaced or deleted to improve the theory. Our aim is to minimize the changes made to an existing theory to accommodate the new data. To this end, we propose a semantic measure of similarity between trees and demonstrate how this can be used to limit the changes made.
Publication date	2006
Publisher	Taylor and Francis
In	Applied Artificial Intelligence 20, no. 1: 1–33.
Language	English
NRC number	NRCC-47437
NPARC number	8913286
Export citation	Export as RIS
Report a correction	Report a correction (opens in a new tab)
Record identifier	2dbd6a74-1a51-4f64-8e48-fd1d7613f506
Record created	2009-04-22
Record modified	2020-04-22

Date modified:: 2024-07-27