Golf ball, P. (2000). In the P. Baseball, H. F. Spirer, & L. Spirer (Eds.), Putting some Case: Examining Large-scale Peoples Liberties Abuses Having fun with Suggestions Solutions and you may Investigation Study. AAAS.
Belin, T. R., & Rubin, D. B. (1995). A strategy to possess calibrating not true-suits rates from inside the listing linkage. Journal of your own American Statistical Organization, 90(430), 694–707.
Bilenko, M., & Mooney, Roentgen. J. (2003). Adaptive Duplicate Identification Playing with Learnable Sequence Similarity Measures. During the KDD ’03 (pp. 39–48). ACM.
Christen, P. (2008). Automated Record Linkage Playing with Seeded Nearby Neighbour and you will Help Vector Server Class. From inside the KDD ’08 (pp. 151–159). ACM.
Christen, P. (2012). A survey of indexing tips for scalable list linkage and deduplication. IEEE Deals towards the Education and you can Studies Technology, 24(9), 1537–1555.
Cohen, W., Raviku). An assessment of string metrics for coordinating names and you can records. During the KDD workshop towards the study cleaning and object integration (Vol. 3, pp. 73–78).
Copas, J., & Hilton, F. (1990). Checklist linkage: Statistical habits to possess matching computer system info. Journal of Royal Mathematical People, Series Good, 153(3), 287–320.
Dai, A. Meters., & Storkey, A good. J. (2011). The classified journalist-situation design to have unsupervised entity resolution. In the Phony neural systems and you will server discovering–icann 2011 (pp. 241–249). Springer.
Fortini, M., Liseo, B., Nuccitelli, A good., & Scanu, M. (2001). To the Bayesian Listing Linkage. Browse during the Official Analytics, 4(1), 185–198.
Gutman, R., Afendulis, C., & Zaslavsky, Good. (2013). A great bayesian means of document connecting to analyze avoid- of-lifetime medical will cost you. Record of Western Statistical Relationship, 108(501), 34–47.
Hsu, W., Lee, Meters. L., Liu, B., & Ling, T. W. (2000). Mining why Vasco da gama brides Mining in the Diabetic patients Databases: Results and Conclusions. Into the KDD ’00 (pp. 430–436). ACM.
A torn-merge Markov strings Monte Carlo procedure of the new Dirichlet procedure mix model
Jewell, N. P., Spagat, Yards., & Jewell, B. L. (2013). MSE and Casualty Matters: Presumptions, Interpretation, and you may Challenges. When you look at the T. B. Seybolt, J. D. Aronson, & B. Fischhoff (Eds.), Counting Civilian Casualties: An introduction to Recording and Estimating Nonmilitary Deaths incompatible. Oxford, UK: Oxford University Push.
Larsen, Yards. D. (2002)ments on the Hierarchical Bayesian Listing Linkage. Within the Legal proceeding of mutual analytical group meetings, section into the survey research procedures (pp. 1995–2000). The fresh American Statistical Association.
Steorts, Roentgen
Larsen, M. D. (2005). Enhances for the List Linkage Idea: Hierarchical Bayesian Checklist Linkage Idea. Within the Proceedings of one’s combined statistical group meetings, section to your questionnaire lookup steps (pp. 3277–3284). New American Analytical Relationship.
Larsen, Yards. D., & Rubin, D. B. (2001). Iterative automated checklist linkage having fun with mixture models. Diary of one’s Western Mathematical Association, 96(453), 32–41.
Lum, K., Rates, M. Age., & Banking companies, D. (2013). Apps of Several Possibilities Quote into the People Rights Browse. The latest Western Statistician, 67(4), 191–200.
Marchant, Letter. G., C., Kaplan, Good., Rubinstein, B. I. P., & Elazar, D. Letter. (2019). D-blink: Delivered avoid-to-end bayesian organization solution.
McCallum, A., & Wellner, B. (2004). Conditional Different types of Term Uncertainty having App so you can Noun Coreference. Inside the Enhances from inside the neural advice processing possibilities (nips ’04) (pp. 905–912). MIT Force.
Miller, P. L., Frawley, S. J., & Sayward, F. Grams. (2000). IMM/Scrub: A domain-Specific Device into Deduplication regarding Inoculation Records Info inside Youthfulness Immunization Registriesputers and you will Biomedical Lookup, 33(2), 126–143.
Murphy, J., Brackbill, R. M., Thalji, L., Dolan, Yards., Pulliam, P., & Walker, D. J. (2007). Measuring and you may Promoting Visibility worldwide Change Cardio Fitness Registry. Statistics in the Drug, 26(8), 1688–1701.
Murray, J. S. (2016). Probabilistic list linkage and you may deduplication immediately following indexing, blocking, and filtering. Record off Confidentiality and you will Privacy, 7(1), 3–24.
Newcombe, H. B., Kennedy, J. Meters., Axford, S. J., & James, Good. P. (1959). Automated linkage off vital records machines can be used to extract” follow-up” statistics out-of group out of files from routine ideas. Science, 130(3381), 954–959.
Sadinle, M. (2014). Detecting Copies during the a murder Registry Playing with an effective Bayesian Partitioning Method. Annals from Applied Analytics, 8(4), 2404–2434.
Sariyar, M., Borg, A great., & Pommerening, K. (2012). Effective Learning Strategies for the new Deduplication off Electronic Patient Study Playing with Classification Trees. Journal out of Biomedical Informatics, 45(5), 893–900.
C., Hall, R., & Fienberg, S. Elizabeth. (2016). Good Bayesian Way of Graphical Number Linkage and Deduplication. Record of one’s Western Analytical Organization, 111(516), 1660–1672.
Tancredi, An excellent., & Liseo, B. (2011). An effective hierarchical Bayesian approach to record linkage and populace size troubles. Annals out of Used Statistics, 5(2B), 1553–1585.