Authorship Attribution in Turkish Texts
Anahtar kelimeler:
Authorship Attribution, Turkish, Forensic Linguistics, Authorship AnalysisÖzet
Bilgisayar teknolojisi alanındaki son gelişmeler, zaman ve mekân sınırları olmaksızın bilgi paylaşmanın yeni yollarını yaratmıştır. Bilgisayar teknolojileri sadece hayatı kolaylaştırmakla ve kullanıcılar için daha erişilebilir kılmakla kalmamış, aynı zamanda yasa dışı faaliyetler için de yeni bir alan açmıştır. Bu yasa dışı eylemler; e-postalar, web siteleri, internet sohbet odaları, forum sayfaları ve sosyal ağ siteleri (Facebook, Twitter, Instagram gibi) aracılığıyla yayılma fırsatı bulmuştur. Çevrimiçi katılımcıların fikirlerini paylaşmak için gerçek adları, yaşadıkları şehir, yaş veya cinsiyet gibi bilgileri vermelerine gerek yoktur ve bu tür anonimlik hisleri suç teşkil eden faaliyetleri teşvik etmektedir. Bu nedenle, ihtilaflı yazarlık vakaları teknoloji çağının temel zorluklarından biri haline gelmiştir. Bu araştırma, Türkçe üzerine derlem tabanlı simüle edilmiş bir adli yazar tespiti vaka çalışması uygulamasıdır. Derlem için metinler katılımcı bir çevrimiçi ansiklopedi olan Ekşi Sözlük (Sour Times) ve Twitter'dan toplanmıştır. Derlem, toplam 52 yazara ait 900 metinden oluşmaktadır. Ancak 105 metin, Twitter'dan seçilen yedi yazara aittir. Uygulanan iki metodolojik yaklaşım, Grant'ın (2013) yaklaşımına göre nitel ve istatistiksel yöntemlerdir. Gerçek dünyadaki vakalarda adli olarak karşılaşılabilecek çeşitli parametrelere bağlı olarak on farklı test uygulanmıştır. Buna göre; özellik türü, aday yazar boyutu, metin boyutu, yazar başına sınırlı sayıda metin ve son olarak türler arası uygulamanın rolü test edilmiştir. Analizler, bu tür birleşik bir yaklaşımın, Türkçede yazar tespiti sağlaması bakımından bazı testlerde umut verici sonuçlar verdiğini ortaya koymuştur. Araştırmanın bulguları, Türkçede bilinmeyen yazarları tespit etme potansiyeli olduğunu göstermiş olup, sonuçların Türkçe metinlerde adli yazar tespiti tekniklerinin daha geniş çaplı uygulamaları için önemli çıkarımlara sahip olduğu görülmektedir.
Referanslar
Aarts, J. and Meijs, W. (1990). Theory and Practice in Corpus Linguistics. Amsterdam: Rodopi.
Abbasi, A. and Chen, H. (2005). Applying Authorship Analysis to Extremist-Group Web Forum Messages. IEEE Intelligent Systems, 20(5), pp.67-75.
Abbasi, A. and Chen, H. (2008). Writeprints: A Stylometric Approach to Identity-Level Identification and Similarity Detection. ACM Transactions on Information Systems, [online] 26(2), pp.1-29. Available at: https://dl.acm.org/citation.cfm?doid=1344411.1344413 [Accessed 16 Apr. 2018].
Agun, H., Yilmazel, S. and Yilmazel, O. (2017). Effects of language processing in Turkish authorship attribution. 2017 IEEE International Conference on Big Data (Big Data), [online] pp.1876-1881. Available at: http://doi.org/10.1109/BigData.2017.8258132 [Accessed 6 May 2018].
Akkoyunlu, B. and Soylu, M. (2011). Sosyal İletişim Ağları ve Dilin Yanlış Kullanımı Üzerine Nitel Bir Çalışma. İlköğretim Online, 10(2), pp.441-453.
Aksut, M., Batur, Z. and Avsar, T. (2006). Sanalca, Sanal Odalarda (İnternet) İletişim ve Türkçe. In: Akademik Bilişim Konferansı. [online] Pamukkale University. Available at: https://ab.org.tr/ab06/bildiri/23.doc [Accessed 14 Dec. 2018].
Alexa (2018). Keyword Research, Competitor Analysis, & Website Ranking. [online] Alexa.com. Available at: https://www.alexa.com/ [Accessed 7 Jan. 2018].
Amasyalı, M. and Diri, B. (2006). Automatic Turkish Text Categorization in Terms of Author, Genre and Gender. In: Kop C., Fliedl G., Mayr H.C., Métais E. (eds) Natural Language Processing and Information Systems. NLDB 2006. Lecture Notes in Computer Science, vol 3999. [online] Berlin, Heidelberg: Springer, pp.221-226. Available at: https://doi.org/10.1007/11765448_22 [Accessed 8 Feb. 2017].
Androutsopoulos, J. (2006). Introduction: Sociolinguistics and Computer-mediated Communication. Journal of Sociolinguistics, 10(4), pp.419-438.
Argamon, S. and Koppel, M. (2013). A Systemic Functional Approach to Automated Authorship Analysis. Journal of Law and Policy, 2(21), pp.299-316.
Argamon, S. and Levitan, S. (2005). Measuring The Usefulness of Function Words for Authorship Attribution. In: Proceedings of ACH/ALLC Conference. [online] University of Victoria, BC,: Association for Computing and the Humanities, pp.1-3. Available at: https://pdfs.semanticscholar.org/1b70/57378e2a300cde88e6f291e146981d338a63.pdf [Accessed 6 Jan. 2018].
Argamon, S., Koppel, M., Fine, J. and Shimoni, A. (2003). Gender, Genre, and Writing Style in Formal Written Texts. Text - Interdisciplinary Journal for the Study of Discourse, [online] 23(3), pp.321-346. Available at: https://doi.org/10.1515/text.2003.014 [Accessed 12 Oct. 2017].
Argamon, S., Koppel, M., Pennebaker, J. and Schler, J. (2008). Automatically Profiling the Author of an Anonymous Text. Communications of the ACM, [online] 52(2), pp.119-123. Available at: http://doi.org/10.1145/1461928.1461959 [Accessed 8 Mar. 2018].
Argamon, S., Šarić, M. and Stein, S. (2003). Style Mining of Electronic Messages for Multiple Authorship Discrimination. In: Proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and data mining - KDD '03. [online] New York, NY: ACM, pp.475-480. Available at: http://doi.org/10.1145/956750.956805 [Accessed 25 Feb. 2018].
Argamon, S., Whitelaw, C., Chase, P., Hota, S., Garg, N. and Levitan, S. (2007). Stylistic text classification using functional lexical features. Journal of the American Society for Information Science and Technology, [online] 58(6), pp.802-822. Available at: https://doi.org/10.1002/asi.20553 [Accessed 8 Mar. 2018].
Aslan, C. (2007). Content Analysis On Language Mistakes Made By Turkish, Turkish Language and Literature Teachers in Internet. In: H. Uzunboylu and N. Çavuş, ed., 7. International Educational Technology Conference. Near East University, pp.90-98.
Baayen, H., van Haltere, H., Neijt, A. and Tweedie, F. (2002). An Experiment in Authorship Attribution. In: Proceedings of JADT 2002: Sixth International Conference on Textual Data Statistical Analysis. [online] pp.29-37. Available at: [suspicious link removed] [Accessed 23 Feb. 2017].
Baayen, H., van Halteren, H. and Tweedie, F. (1996). Outside the cave of shadows: using syntactic annotation to enhance authorship attribution. Literary and Linguistic Computing, [online] 11(3), pp.121-132. Available at: https://doi.org/10.1093/llc/11.3.121 [Accessed 7 Mar. 2018].
Babbie, E. (1989). The Practice of Social Eesearch. 5th ed. Belmont, CA: Wadsworth Publishing Company.
Baber, A. (2004). Idiolects. In: E. Zalta, ed., Stanford Encyclopedia of Philosophy. [online] Palo Alto, CA: CSLI, University of Stanford. Available at: http://plato.stanford.edu/entries/idiolects/ [Accessed 14 Jan. 2018].
Baker, P. (2006). Using Corpora in Discourse Analysis. London: Continuum.
Barlow, M. (2010). Individual usage: a corpus-based study of idiolects. Paper presented at the 34th International LAUD Symposium, Landau, Germany. [online] Available at: http://michaelbarlow.com/barlowLAUD.pdf [Accessed 16 Oct. 2017].
Baron, N. (2003). Language and the Internet. In: A. Farghaly, ed., The Stanford Handbook for Language Engineers. Stanford, CA: CSLI, pp.59-127.
Barthes, R. (1977). The Death of the Author. In: Image, Music, Text: Essays Selected and Translated by Stephen Heath. New York: Hill and Wang, pp.142–148.
Barton, D. and Lee, C. (2013). Language Online: Investigating Digital Texts and Practices. Milton Park, Abingdon, Oxon: Routledge.
Bay, Y. and Çelebi, E. (2016). Feature Selection for Enhanced Author Identification of Turkish Text. In: O. Abdelrahman, E. Gelenbe, G. Gorbil and R. Lent, ed., Information Sciences and Systems 2015. Lecture Notes in Electrical Engineering, vol 363. [online] Cham: Springer, pp.371-379. Available at: https://doi.org/10.1007/978-3-319-22635-4_34 [Accessed 14 May 2018].
Baym, N. (2007). The new shape of online community: The example of Swedish independent music fandom. First Monday, [online] 12(8). Available at: https://doi.org/10.5210/fm.v12i8.1978 [Accessed 11 Jan. 2018].
Becker, A. (1984). Toward A Post-Structuralist View of Language Learning: A Short Essay. Language Learning, 33(5), pp.217-220.
Bell, M. (2007). The Transformation of The Encyclopedia: A Textual Analysis and Comparison of The Encyclopaedia Britannica and Wikipedia. Master’s thesis. Ball State University.
Beyond Microblogging: Conversation and Collaboration via Twitter. (2009). 2009 42nd Hawaii International Conference on System Sciences.
Bhargava, M., Mehndiratta, P. and Asawa, K. (2013). Stylometric Analysis for Authorship Attribution on Twitter. In: Big Data Analytics. BDA 2013. Lecture Notes in Computer Science, vol 8302. [online] Cham: Springer, pp.37-47. Available at: https://doi.org/10.1007/978-3-319-03689-2_3 [Accessed 14 May 2018].
Biber, D., Conrad, S. and Reppen, R. (1998). Corpus Linguistics: Investigating Language Structure and Use. Cambridge: Cambridge University Press.
Blanchard, A. (2004). Blogs as Virtual Communities: Identifying a Sense of Community in the Julie/Julia Project. Into the Blogosphere Articles, [online] Retrieved from the University of Minnesota Digital Conservancy. Available at: http://hdl.handle.net/11299/172837 [Accessed 10 Apr. 2017].
Bloch, B. (1948). A Set of Postulates for Phonemic Analysis. Language, 24(1), pp.3-46.
Bloomfield, L. (1933). Language. New York: Holt, Rinehart, and Winston.
Boukhaled, M., Frontini, F., Bourgne, G. and Ganascia, J. (2015). Computational Study of Stylistics: A Clustering-based Interestingness Measure for Extracting Relevant Syntactic Patterns. International Journal of Computational Linguistics and Applications, 6(1), pp.45–62.
Boutwell, S. (2011). Authorship Attribution of Short Messages Using Multimodal Features. Master's Thesis. Naval Postgraduate School.
boyd, d., Golder, S. and Lotan, G. (2010). Tweet, Tweet, Retweet: Conversational Aspects of Retweeting on Twitter. In: 43rd Hawaii International Conference on System Sciences. pp.1-10.
Bozkurt, I., Baglioglu, O. and Uyar, E. (2007). Authorship Attribution: Performance of Various Features and Classification Methods. 22nd International Symposium on Computer and Information Sciences, ISCIS 2007 - Proceedings. [online] Available at: http://repository.bilkent.edu.tr/handle/11693/27016 [Accessed 23 Feb. 2018].
Britannica.com. (2014). Encyclopedia Britannica. [online] Available at: https://www.britannica.com/ [Accessed 13 Jan. 2017].
Burrows, J. (2002). 'Delta': a Measure of Stylistic Difference and a Guide to Likely Authorship. Literary and Linguistic Computing, 17(3), pp.267-287.
Cakir, H. and Topcu, H. (2006). Bir İletişim Dili Olarak İnternet. Erciyes Üniversitesi Sosyal Bilimler Enstitüsü Dergisi, 19(2), pp.71-96.
Can, F. and Patton, J. (2004). Change of Writing Style with Time. Computers and the Humanities, 38(1), pp.61-82.
Chaski, C. (2001). Empirical Evaluations of Language-based Author Identification Techniques. Forensic Linguistics, [online] 8(1), pp.1-65. Available at: http://citeseerx.ist.psu.edu/viewdoc/summary;jsessionid=C31C2DA74445ECBDC779719E20AF5359?doi=10.1.1.465.5651 [Accessed 2 Jan. 2018].
Chaski, C. (2005). Who’s at the Keyboard? Authorship Attribution in Digital Evidence Investigations. International Journal of Digital Evidence, 1(4), pp.1-14.
Chaski, C. (2007). The Keyboard Dilemma and Authorship Identification. In: P. Craiger and S. Shenoi, ed., Advances in Digital Forensics III. New York, NY: Springer, pp.133-146.
Chen, X., Hao, P., Chandramouli, R. and Subbalakshmi, K. (2011). Authorship Similarity Detection from Email Messages. In: P. Perner, ed., Machine Learning and Data Mining (MLDM) in Pattern Recognition. [online] Berlin, Heidelberg: Springer, pp.375-386. Available at: https://doi.org/10.1007/978-3-642-23199-5_28 [Accessed 7 Mar. 2018].
Cheng, E. (2013). Being Pragmatic About Forensic Linguistics. Journal of Law and Policy, 21(2), pp.541-550.
Coleman, S. (2006). E-mail, Terrorism, and the Right to Privacy. Ethics and Information Technology, [online] 8(1), pp.17-27. Available at: https://link.springer.com/article/10.1007%2Fs10676-006-9103-5 [Accessed 3 Feb. 2018].
Cotterill, J. (2010). How to use corpus linguistics in Forensic Linguistics?. In: A. O'Keeffe and M. McCarthy, ed., The Routledge Handbook of Corpus Linguistics. London: Routledge, pp.578–590.
Coulthard, M. (1994). On the use of Corpora in the Analysis of Forensic Texts. International Journal of Speech Language and the Law, [online] 1(1), pp.27-43. Available at: https://journals.equinoxpub.com/index.php/IJSLL/article/view/16584 [Accessed 16 Feb. 2018].
Coulthard, M. (2004). Author Identification, Idiolect, and Linguistic Uniqueness. Applied Linguistics, [online] 25(4), pp.431-447. Available at: https://academic.oup.com/applij/article/25/4/431/193364 [Accessed 29 Apr. 2018].
Coulthard, M. (2005). Some Forensic Applications of Descriptive Linguistics. VEREDAS - Rev. Est. Ling. Juiz de Fora, [online] 9(1), pp.9-28. Available at: http://www.ufjf.br/revistaveredas/files/2009/12/artigo016.pdf [Accessed 8 Mar. 2018].
Coulthard, M. (2013). On Admissible Linguistic Evidence. Journal of Law and Policy, 21(2), pp.441–466.
Coulthard, M., Johnson, A. and Wright, D. (2017). An introduction to forensic linguistics. Abingdon, Oxon: Routledge.
Coulthard, M., Johnson, A. and Wright, D. (2017). An Introduction to Forensic Linguistics. Abingdon, Oxon: Routledge.
Cresswell, M. (2003). Heaps, Prototypes and Ethics: The Consequences of Using Judgements of Student Performance to Set Examination Standards in a Time of Change. London: Institute of Education, University of London.
Creswell, J. and Plano Clark, V. (2007). Designing and conducting mixed methods research. Thousand Oaks, CA: SAGE Publications.
Crystal, D. (2007). How Language Works. London: Penguin Books.
Crystal, D. (2008). Txtng : The gr8 db8. Oxford: Oxford University Press.
Crystal, D. (2011). Internet Linguistics: A Student Guide. London: Routledge.
Danet, B. (2001). Cyberpl@y: Communicating Online. Oxford: Berg.
de Vel, O., Anderson, A., Corney, M. and Mohay, G. (2001). Mining e-mail Content for Author Identification Forensics. ACM SIGMOD Record, [online] 30(4), pp.55-64. Available at: https://dl.acm.org/citation.cfm?doid=604264.604272 [Accessed 21 Jan. 2018].
Denzin, N. (1989). Interpretive Interactionism. Newbury Park: Sage.
Diederich, J. (2003). Authorship Attribution with Support Vector Machines. Applied Intelligence, 19(1/2), pp.109-123.
Diri, B. and Amasyali, M. (2003). Automatic Author Detection for Turkish Texts. In: Artificial Neural Networks and Neural Information Processing (ICANN/ICONIP). [online] pp.1-4. Available at: https://pdfs.semanticscholar.org/f142/2461024fcec79c94fe2671923ce79be0e4ef.pdf [Accessed 19 Nov. 2017].
Dogu, B., Ziraman, Z. and Ziraman, D. (2009). Web Based Authorship in the Context of User Generated Content, An Analysis of a Turkish Web Site: Eksi Sozluk. In: D. Riha and A. Maj, ed., The Real and the Virtual. Oxford: Inter-Disciplinary Press, pp.119-128.
Donath, J. and boyd, d. (2004). Public displays of connection. BT Technology Journal, 22(4), pp.71-82.
Dresner, E. and Herring, S. (2010). Functions of the Nonverbal in CMC: Emoticons and Illocutionary Force. Communication Theory, 20(3), pp.249-268.
Ebner, M., Lienhardt, C., Rohs, M. and Meyer, I. (2010). Microblogs in Higher Education – A chance to facilitate informal and process-oriented learning?. Computers & Education, 55(1), pp.92-100.
Eden, S. and Heiman, T. (2011). Computer Mediated Communication: Social Support for Students with and without Learning Disabilities. Educational Technology & Society, [online] 14(2), pp.89–97. Available at: https://www.semanticscholar.org/ [Accessed 12 Feb. 2018].
Eder, M. (2013). Does size matter? Authorship attribution, small samples, big problem. Digital Scholarship in the Humanities, [online] 30(2), pp.167-182. Available at: https://doi.org/10.1093/llc/fqt066 [Accessed 7 Mar. 2018].
Eisenmann, M., O’Neil, J. and Geddes, D. (2013). Testing the Reliability of Metrics Proposed as Standards for Traditional Media Analysis. In: Proceedings from the 16th Annual InternationalPublic Relations Research Conference. [online] Available at: http://kdpaine.blogs.com/files/eisenmann-and-oneal_reliability0001.pdf [Accessed 10 Jan. 2018].
Ekinci, E. and Takci, H. (2012). Using Authorship Analysis Techniques in Forensic Analysis of Electronic Mails. 2012 20th Signal Processing and Communications Applications Conference (SIU 2012), pp.543-546.
ekşi sözlük. (2018). ekşi sözlük - kutsal bilgi kaynağı. [online] Available at: https://eksisozluk.com/ [Accessed 15 Jan. 2018].
Emigh, W. and Herring, S. (2005). Collaborative Authoring on the Web: A Genre Analysis of Online Encyclopedias. In: Proceedings of the 38th Annual Hawaii International Conference on System Sciences. [online] IEEE, pp.1-11. Available at: http://doi.org/10.1109/HICSS.2005.149 [Accessed 12 Mar. 2017].
Federal Committee (2017). Rule 702. Testimony by Expert Witnesses | Federal Rules of Evidence | LII / Legal Information Institute. [online] Federal Committee. Available at: https://www.law.cornell.edu/rules/fre/rule_702 [Accessed 23 Mar. 2017].
Fisher, B. and Fisher, D. (2012). Techniques of Crime Scene Investigation. Boca Raton, Fla.: CRC Press.
Fitzgerald, J. (2004). Using a Forensic Linguistic Approach to track the Unabomber. In: J. Campbell and D. Denivi, ed., Profilers: Leading Investigators take you Inside the Criminal Mind. New York: Prometheus Books, pp.193–222.
Fletcher, W. (2012). Corpus Analysis of the World Wide Web. The Encyclopedia of Applied Linguistics. [online] Available at: https://onlinelibrary.wiley.com/doi/abs/10.1002/9781405198431.wbeal0254 [Accessed 8 Mar. 2017].
Forsyth, R. and Holmes, D. (1996). Feature-finding for text classification. Digital Scholarship in the Humanities, 11(4), pp.163-174.
Foster, D. (2001). On the Trail of Anonymous. New York: Henry Holt & Company.
Freelon, D. (2013). ReCal OIR: Ordinal, Interval, and Ratio Intercoder Reliability as a Web Service. International Journal of Internet Science, 1(8), pp.10-16.
Gamon, M. (2004). Linguistic Correlates of Style: Authorship Classification with Deep Linguistic Analysis Features. In: COLING '04 Proceedings of the 20th international conference on Computational Linguistics. [online] Stroudsburg, PA: Association for Computational Linguistics, pp.611–617. Available at: http://doi.org/10.3115/1220355.1220443 [Accessed 17 Feb. 2018].
Gilquin, G. (2010). Corpus, Cognition and Causative Constructions. Amsterdam: Benjamins.
Gilquin, G. and Gries, S. (2009). Corpora and Experimental Methods: A State-of-the-art Review. Corpus Linguistics and Linguistic Theory, [online] 5(1), pp.1-26. Available at: https://www.degruyter.com/view/j/cllt.2009.5.issue-1/cllt.2009.001/cllt.2009.001.xml [Accessed 13 Jan. 2018].
Gimpel, K., Schneider, N., O’Connor, B., Das, D., Mills, D., Eisenstein, J., Heilman, M., Yogatama, D., Flanigan, J. and Smith, N. (2011). Part-of-speech tagging for Twitter: Annotation, features, and experiments. In: Proc. of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies (ACL HLT 2011). [online] Portland, USA: Association for Computational Linguistics, pp.42–47. Available at: http://www.aclweb.org/anthology/P11-2008 [Accessed 10 Jan. 2018].
Goffman, E. (1981). Forms of Talk. Philadelphia: University of Pennsylvania Press.
Göksel, A. and Kerslake, C. (2005). Turkish: A Comprehensive Grammar. London: Routledge.
Goldstein-Stewart, J., Winder, R. and Sabin, R. (2009). Person Identification from Text and Speech Genre Samples. In: Proceedings of the 12th Conference of the European Chapter of the ACL. [online] Athens, Greece: Association for Computational Linguistics, pp.336-344. Available at: http://www.aclweb.org/anthology/E09-1039 [Accessed 7 Feb. 2018].
Grant, T. (2004). Authorship Attribution in a Forensic Context. PhD thesis. University of Birmingham.
Grant, T. (2007). Quantifying Evidence in Forensic Authorship Analysis. International Journal of Speech Language and the Law, [online] 14(1), pp.1-25. Available at: https://journals.equinoxpub.com/index.php/IJSLL/article/view/3955 [Accessed 17 Apr. 2016].
Grant, T. (2008). Approaching Questions in Forensic Authorship Analysis. In: J. Gibbons and M. Turell, ed., Dimensions of Forensic Linguistics. Amsterdam: John Benjamins, pp.215–229.
Grant, T. (2010). Txt 4n6: Idiolect Free Authorship Analysis?. In: M. Coulthard and A. Johnson, ed., The Routledge Handbook of Forensic Linguistics. London: Routledge, pp.508–522.
Grant, T. (2013). Txt 4N6: Method, Consistency and Distinctiveness in the Analysis of SMS Text Messages. Journal of Law and Policy, 2(21), pp.467–494.
Grant, T. and Baker, K. (2001). Identifying Reliable, Valid Markers of Authorship: A Response to Chaski. Forensic Linguistics, [online] 8(1), pp.66-79. Available at: https://journals.equinoxpub.com/index.php/IJSLL/article/view/1690 [Accessed 15 Jan. 2016].
Grant, T. and Macleod, N. (2016). Assuming Identities Online: Experimental Linguistics Applied to the Policing of Online Paedophile Activity. Applied Linguistics, 37(1), pp.50-70.
Grant, T. and Nini, A. (2013). Bridging the Gap between Stylistic and Cognitive Approaches to Authorship Analysis Using Systemic Functional Linguistics and Multidimensional Analysis. International Journal of Speech Language and the Law, [online] 20(2). Available at: https://journals.equinoxpub.com/index.php/IJSLL/article/view/13599 [Accessed 2 Feb. 2017].
Grieve, J. (2007). Quantitative Authorship Attribution: An Evaluation of Techniques. Literary and Linguistic Computing, [online] 22(3), pp.251-270. Available at: https://academic.oup.com/dsh/article/22/3/251/951481 [Accessed 3 Feb. 2018].
Halliday, M. and Hasan, R. (1976). Cohesion in English. London: Longman.
Halliday, M. and Matthiessen, C. (2004). An Introduction to Functional Grammar. 3rd ed. London: Routledge.
Hänlein, H. (1999). Studies in Authorship Recognition: A Corpus-based Approach (Vol. 352). Frankfurt am Main: Peter Lang.
Haug, M. and Baird, E. (2011). Finding the Error in Daubert. Hastings Law Journal, 62(3), pp.737-756.
Haylock, C. and Muscarella, L. (1999). Net Success: 24 Leaders in Web Commerce Show You How to Put the Internet to Work for Your Business. Holbrook, Mass.: Adams Media Corporation.
Herring, S. (1996). Linguistic and Critical Analysis of Computer-Mediated Communication: Some Ethical and Scholarly Considerations. The Information Society, [online] 12(2), pp.153-168. Available at: https://doi.org/10.1080/911232343 [Accessed 16 Feb. 2018].
Herring, S. (2001). Computer‐Mediated Discourse. In: D. Schiffrin, D. Tannen and H. Hamilton, ed., The Handbook of Discourse Analysis. Malden, MA: Blackwell Publishers Ltd, pp.612-634.
Herring, S. (2004). Computer-mediated Discourse Analysis: An Approach to Researching Online Communities. In: S. Barab, R. Kling and J. Gray, ed., Designing for Virtual Communities in the Service of Learning. New York: Cambridge University Press, pp.338-376.
Herring, S. (2004). Computer-Mediated Discourse Analysis. In: S. Barab, R. Kling and J. Gray, ed., Designing for Virtual Communities in the Service of Learning. Cambridge: Cambridge University Press, pp.338-376.
Herring, S. (2004). Slouching Toward the Ordinary: Current Trends in Computer-Mediated Communication. New Media & Society, [online] 6(1), pp.26-36. Available at: http://journals.sagepub.com/doi/10.1177/1461444804039906 [Accessed 10 Mar. 2018].
Herring, S. (2004). Slouching Toward the Ordinary: Current Trends in Computer-Mediated Communication. New Media & Society, 6(1), pp.26-36.
Herring, S. (2007). A Faceted Classification Scheme for Computer-Mediated Discourse. Language@Internet, 4(2007), pp.1-37.
Herring, S., Johnson, D. and DiBenedetto, T. (1992). Participation in Electronic Discourse in A "Feminist" Field. In: Locating Power: Proceedings of the Second Berkeley Women and Language Conference. pp.250-262.
Herring, S., Scheidt, L., Bonus, S. and Wright, E. (2018). Bridging The Gap: A Genre Analysis of Weblogs. In: 37th Annual Hawaii International Conference on System Sciences. [online] IEEE. Available at: http://doi.org/10.1109/HICSS.2004.1265271 [Accessed 15 Feb. 2018].
Hillery, G. (1955). Definitions of Community: Areas of Agreement. Rural Sociology, 20(2), pp.111-123.
Hirst, G. and Feiguina, O. (2007). Bigrams of Syntactic Labels for Authorship Discrimination of Short Texts. Literary and Linguistic Computing, 22(4), pp.405-417.
Holmes, D. (1994). Authorship Attribution. Computers and the Humanities, 28(2), pp.87-106.
Honeycutt, C. and Herring, S. (2009). Beyond Microblogging: Conversation and Collaboration via Twitter. In: Proceedings of the Forty-Second Hawai’i International Conference on System Sciences (HICSS-42). [online] IEEE Press, pp.1-10. Available at: http://doi.org/10.1109/HICSS.2009.89 [Accessed 8 May 2018].
Hoover, D. (2004). Testing Burrows's Delta. Literary and Linguistic Computing, 19(4), pp.453-475.
Hornby, A. (2005). Oxford Advanced Learner's Dictionary. Oxford: Oxford University Press.
Howald, B. (2009). Authorship Attribution under the Rules of Evidence: Empirical Approaches in a Layperson's Legal System. International Journal of Speech Language and the Law, 15(2), pp.219-247.
Hunston, S. (2002). Corpora in Applied Linguistics. Cambridge: Cambridge University Press.
Hyland, K., Chau, M. and Handford, M. (2012). Corpus Applications in Applied Linguistics. 1st ed. London: Bloomsbury.
Iqbal, F., Binsalleeh, H., Fung, B. and Debbabi, M. (2010). Mining Writeprints from Anonymous e-mails for Forensic Investigation. Digital Investigation, 7(1-2), pp.56-64.
Jabbar, M. (2010). Overcoming Daubert's Shortcomings in Criminal Trials: Making the Error Rate the Primary Factor in Daubert's Validity Inquiry. New York University Law Review, 85(6), pp.2034-2064.
Jakobson, R. (1971). Word and Language. The Hague: Mouton.
Johnson, A. and Woolls, D. (2010). Who wrote this? The linguist as detective. In: S. Hunston and D. Oakey, ed., Introducing Applied Linguistics: Concepts and Skills. London: Routledge, pp.111–118.
Johnson, A. and Wright, D. (2014). Identifying İdiolect in Forensic Authorship Attribution: An n-gram Textbite Approach. Language and Law (Linguagem e Direito), 1(1), pp.37-69.
Johnson, S. (1997). Theorizing Language and Masculinity: A Feminist Perspective. In: S. Johnson and U. Meinhof, ed., Language and Masculinity. Oxford: Blackwell, pp.8–26.
Juola, P. (2006). Authorship Attribution for Electronic Documents. In: M. Olivier and S. Shenoi, ed., Advances in Digital Forensics II. New York, NY: Springer, pp.119-130.
Juola, P. (2008). Authorship attribution. Boston: NOW Publishing.
Juola, P. (2013). Stylometry and immigration: A case study. Journal of Law and Policy, 2(21), pp.287-298.
Juola, P. (2015). The Rowling Case: A Proposed Standard Analytic Protocol for Authorship Questions. Digital Scholarship in the Humanities, [online] pp.i100–i113. Available at: https://academic.oup.com/dsh/article/30/suppl_1/i100/363234 [Accessed 20 May 2018].
Kara, M. (2006). İnternet Türkçesinin Çığlığı: Türkçe Dili (!) ve Diğerleri. Akademik Araştırmalar Dergisi, 30, pp.157-170.
Katsuno, H. and Yano, C. (2007). Kaomoji and Expressivity in a Japanese Housewives’ Chat Room. In: B. Danet and S. Herring, ed., The Multilingual Internet: Language, Culture, and Communication Online. Oxford: Oxford University Press, pp.278-300.
Kaye, D. (2001). The Dynamics of Daubert: Methodology, Conclusions, and Fit in Statistical and Econometric Studies. Virginia Law Review, 87(8), pp.1933-2018.
Kennedy, G. (1998). An Introduction to Corpus Linguistics. London: Longman.
Kestemont, M., Luyckx, K., Daelemans, W. and Crombez, T. (2012). Cross-Genre Authorship Verification Using Unmasking. English Studies, 93(3), pp.340-356.
Kinkus, J. (2002). Science and Technology Resources on the Internet: Computer Security. Issues in Science & Technology Librarianship, [online] (36). Available at: http://doi.org/10.5062/F4QN64P2 [Accessed 17 Mar. 2018].
Kniffka, H. (2007). Working in Language and Law. Basingstoke: Palgrave Macmillan.
Komito, L. (1998). The Net as a Foraging Society: Flexible Communities. The Information Society, 14(2), pp.97-106.
Konda (2007). Toplumsal Yapı Araştırması 2006: Biz Kimiz?. Istanbul: Konda Araştırma ve Danışmanlık, pp.Available at: http://konda.com.tr/wp-content/uploads/2017/02/2006_09_KONDA_Toplumsal_Yapi.pdf [Accessed 17 Mar. 2018].
Koppel, M. and Schler, J. (2018). Exploiting Stylistic Idiosyncrasies for Authorship Attribution. In: Proceedings of the 18th IJCAI Workshop on Computational Approaches to Style Analysis and Synthesis. [online] Available at: [suspicious link removed]. [Accessed 10 Feb. 2018].
Koppel, M., Schler, J. and Argamon, S. (2009). Computational Methods in Authorship Attribution. Journal of the American Society for Information Science and Technology, [online] 60(1), pp.9-26. Available at: https://onlinelibrary.wiley.com/doi/abs/10.1002/asi.20961 [Accessed 7 May 2018].
Koppel, M., Schler, J. and Argamon, S. (2011). Authorship Attribution in the Wild. Language Resources and Evaluation, [online] 45(1), pp.83-94. Available at: https://link.springer.com/article/10.1007%2Fs10579-009-9111-2 [Accessed 1 Mar. 2018].
Koppel, M., Schler, J. and Argamon, S. (2013). Authorship Attribution: What’s easy and what’s hard?. Journal of Law and Policy, 21(2), pp.317–331.
Koppel, M., Schler, J., Argamon, S. and Messeri, E. (2006). Authorship attribution with thousands of candidate authors. Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval - SIGIR '06, [online] pp.659-660. Available at: https://dl.acm.org/citation.cfm?id=1148304 [Accessed 2 Apr. 2018].
Kotzé, E. (2010). Author Identification from Opposing Perspectives in Forensic Linguistics. Southern African Linguistics and Applied Language Studies, [online] 28(2), pp.185-197. Available at: https://doi.org/10.2989/16073614.2010.519111 [Accessed 17 Feb. 2018].
Kredens, K. (2002). Towards a Corpus-based Methodology of Forensic Authorship Attribution: A Comparative Study of Two Idiolects. In: B. Lewandowska-Tomaszczyk, ed., PALC’01: Practical Applications in Language Corpora. Frankfurt am Mein: Peter Lang, pp.405–437.
Kredens, K. (2006). On the Status of Linguistic Evidence in Litigation. In: P. Nowak and P. Nowakowski, ed., Language, Communication, Information. Poznan: Sorus Publishers, pp.23-30.
Kredens, K. and Coulthard, M. (2012). Corpus Linguistics inAauthorship Identification. In: P. Tiersma and L. Solan, ed., The Oxford Handbook of Language and Law. Oxford: Oxford University Press, pp.504–516.
Krippendorff, K. (2004). Reliability in Content Analysis : Some Common Misconceptions and Recommendations. Human Communication Research, 30(3), pp.411-433.
Kucukyilmaz, T., Cambazoglu, B., Aykanat, C. and Can, F. (2008). Chat Mining: Predicting User and Message Attributes In Computer-Mediated Communication. Information Processing & Management, [online] 44(4), pp.1448-1466. Available at: https://doi.org/10.1016/j.ipm.2007.12.009 [Accessed 16 Jan. 2018].
Larner, S. (2014). A Preliminary Investigation into the Use of Fixed Formulaic Sequences as a Marker of Authorship. International Journal of Speech Language and the Law, [online] 21(1), pp.1-22. Available at: https://journals.equinoxpub.com/index.php/IJSLL/article/view/15423 [Accessed 16 Nov. 2017].
Layton, R., Watters, P. and Dazeley, R. (2010). Authorship Attribution for Twitter in 140 Characters or Less. In: 2010 Second Cybercrime and Trustworthy Computing Workshop. pp.1-8.
Leech, G. (1992). Corpora and Theories of Linguistic Performance. In: J. Svartvik, ed., Directions in Corpus Linguistics: Proceedings of Nobel Symposium 82. Berlin: Mouton de Gruyter, pp.125–148.
Leech, G. (2006). A Glossary of English Grammar. Edinburgh: Edinburgh University Press.
Leonard, R. (2006). Forensic Linguistics: Applying the Scientific Principles of Language Analysis to Issues of the Law. International Journal of the Humanities, 3(7), pp.65-69.
Leonard, R., Ford, J. and Christensen, T. (2017). Forensic Linguistics: Applying the Science of Linguistics to Issues of the Law. Hofstra Law Review, 45(3), pp.881-897.
Lombard, M., Snyder-Duch, J. and Bracken, C. (2002). Content Analysis in Mass Communication: Assessment and Reporting of Intercoder Reliability. Human Communication Research, [online] 28(4), pp.587-604. Available at: https://doi.org/10.1111/j.1468-2958.2002.tb00826.x [Accessed 17 Oct. 2017].
López-Escobedo, F., Méndez-Cruz, C., Sierra, G. and Solórzano-Soto, J. (2013). Analysis of Stylometric Variables in Long and Short Texts. Procedia - Social and Behavioral Sciences, 95, pp.604-611.
Love, H. (2002). Authorship and Attribution. Cambridge: Cambridge University Press.
Lüdeling, A., Evert, S. and Baroni, M. (2006). Using Web Data for Linguistic Purposes. Language and Computers, 1(59), pp.7-24.
Luyckx, K. (2010). Scalability Isues in Authorship Attribution. PhD thesis. Proefschrift Universiteit Antwerpen.
Luyckx, K. and Daelemans, W. (2008). Using Syntactic Features to Predict Author Personality from Text. In: Proceedings of Digital Humanities 2008. [online] Available at: http://www.cnts.ua.ac.be/papers/2008/LD08dh.pdf [Accessed 20 Sep. 2017].
Luyckx, K. and Daelemans, W. (2010). The Effect of Author Set Size and Data Size in Authorship Attribution. Literary and Linguistic Computing, 26(1), pp.35-55.
MacEnery, T. and Wilson, A. (2001). Corpus Linguistics: An Introduction. 2nd ed. Edinburgh: Edinburgh University Press.
MacLeod, N. (2010). Police Interviews with Women Reporting Rape: A Critical Discourse Analysis. PhD thesis. Aston University.
MacLeod, N. and Grant, T. (2012). Whose Tweet? Authorship Analysis of Micro-blogs and Other Short Form Messages. In: S. Tomblin, N. MacLeod, R. Sousa-Silva and M. Coulthard, ed., Proceedings of the Tenth International Association of Forensic Linguists' Biennial Conference. [online] Aston University, Birmingham, pp.210–224. Available at: http://www.forensiclinguistics.net [Accessed 8 Sep. 2016].
Madge, C. (2007). Developing a Geographers' Agenda for Online Research Ethics. Progress in Human Geography, [online] 31(5), pp.654-674. Available at: http://journals.sagepub.com/doi/10.1177/0309132507081496 [Accessed 2 Apr. 2017].
Madigan, D., Genkin, A., Lewis, D., Argamon, S., Fradkin, D. and Ye, L. (2005). Author Identification on the Large Scale. In: Proc. of Classification Society of N. America, 2005. [online] pp.1-20. Available at: [suspicious link removed] [Accessed 11 Dec. 2017].
McEnery, T. and Hardie, A. (2012). Corpus Linguistics: Method, Theory and Practice. Cambridge: Cambridge University Press.
McEnery, T. and Wilson, A. (1996). Corpus Linguistics: An Introduction. Edinburgh: Edinburgh University Press.
McEnery, T., Xiao, R. and Tono, Y. (2006). Corpus-Based Language Studies: An Advanced Resource Book. London: Routledge.
McMenamin, G. (2001). Style Markers in Authorship Studies. Forensic Linguistics, 8(2), pp.93-97.
McMenamin, G. (2002). Forensic Linguistics: Advances in Forensic Stylistics. Boca Raton, FLA.: CRC Press.
McMenamin, G. (2010). Forensic Stylistics. Theory and Practice of Forensic Stylistics. In: M. Coulthard and A. Johnson, ed., The Routledge Handbook of Forensic Linguistics. London: Routledge, pp.487–507.
Mendenhall, T. (1887). The Characteristic Curves of Composition. Science, [online] 9(214), pp.237-249. Available at: [suspicious link removed] [Accessed 21 Nov. 2017].
Mikros, G. and Perifanos, K. (2013). Authorship Attribution in Greek Tweets Using Author’s Multilevel N-Gram Profiles. In: AAAI Spring Symposium: Analyzing Microtext. [online] Association for the Advancement of Artificial Intelligence. Available at: https://www.aaai.org/ocs/index.php/SSS/SSS13/paper/download/5714/5914 [Accessed 17 May 2018].
Mingzhe, J. and Minghu, J. (2012). Text Clustering on Authorship Attribution Based on the Features of Punctuations Usage, Signal Processing (ICSP). In: Proceedings of the 11th International IEEE Conference on Digital Object Identifiers. IEEE, pp.217 – 2178.
Mischaud, E. (2007). Twitter: Expressions of the Whole Self. An investigation into user appropriation of a web-based communications platform. Master's thesis. London School of Economics and Political Science.
Mosteller, F. and Wallace, D. (1963). Inference in an Authorship Problem. Journal of the American Statistical Association, 58(302), pp.275-309.
Mosteller, F. and Wallace, D. (1964). Inference and Disputed Authorship: The Federalist. Reading, MA: Addison-Wesley Publishing Company Inc.
Nagarajan, M., Purohit, H. and Sheth, A. (2010). A Qualitative Examination of Topical Tweet and Retweet Practices. In: Proceedings of the Fourth International AAAI Conference on Weblogs and Social Media. [online] pp.295-298. Available at: https://www.aaai.org/ocs/index.php/ICWSM/ICWSM10/paper/viewFile/1484/1880 [Accessed 16 Nov. 2018].
Nazar, R. and Sánchez Pol, M. (2007). An Extremely Simple Authorship Attribution System. In: M. Turell, J. Cicres and M. Spassova, ed., Proceedings of the 2nd European IAFL Conference on Forensic Linguistics / Language and the Law 2006. Barcelona: Documenta Universitaria.
Neuendorf, K. (2002). The Content Analysis Guidebook. London: Sage Publications.
Nini, A. (2015). Authorship Profiling in a Forensic Context. PhD thesis. Aston University.
Nini, A. (2018). An Authorship Analysis of the Jack the Ripper Letters. Digital Scholarship in the Humanities, [online] fqx065, pp.1-16. Available at: https://doi.org/10.1093/llc/fqx065 [Accessed 17 Mar. 2018].
NVivo. (2018). QSR International Pty Ltd, https://www.qsrinternational.com/nvivo/home.
Oldenburg, R. (1989). The great good place. New York: Da Capo Press.
Olsson, J. (2004). Forensic Linguistics: An Introduction to Language, Crime, and the Law. London: Continuum.
Olsson, J. (2008). Forensic Linguistics. London: Continuum.
Overdorf, R. and Greenstadt, R. (2016). Blogs, Twitter Feeds, and Reddit Comments: Cross-domain Authorship Attribution. Proceedings on Privacy Enhancing Technologies, 2016(3), pp.155–171.
Owoputi, O., O'Connor, B., Dyer, C., Gimpel, K., Schneider, N. and Smith, N. (2013). Improved Part-of-Speech Tagging for Online Conversational Text with Word Clusters. In: Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. [online] Association for Computational Linguistics, pp.390-391. Available at: http://repository.cmu.edu/cgi/viewcontent.cgi?article=1039&context=lti [Accessed 24 May 2018].
Oxburgh, G., Myklebust, T., Grant, T. and Milne, R. (2015). Communication in Investigative and Legal Settings. Communication in Investigative and Legal Contexts, [online] pp.1-13. Available at: https://doi.org/10.1002/9781118769133.ch1 [Accessed 8 Jan. 2018].
Polat, N. (2007). Linking Social Networks and Attainment in an L2 Accent: Kurds Acquiring Turkish. In: Proceedings of the Fifteenth Annual Symposium About Language and Society-Austin. [online] Texas Linguistic Forum 51, pp.144-153. Available at: http://salsa.ling.utexas.edu/proceedings/2007/Polat.pdf [Accessed 29 Jan. 2018].
Queralt Estevez, S. and Turell Julià, M. (2013). A semi-automatic Authorship Attribution Technique Applied to Real Forensic Cases Involving Judgments in Spanish. In: R. Sousa-Silva, R. Faria, N. Gavaldà and B. Maia, ed., Bridging the Gap(s) between Language and the Law: Proceedings of the 3rd European Conference of the International Association of Forensic Linguists. Porto: Faculdade de Letras da Universidade do Porto., pp.10-18.
Rheingold, H. (1993). The Virtual Community. Reading, Mass.: Addison-Wesley.
Rico-Sulayes, A. (2011). Statistical Authorship Attribution of Mexican Drug Trafficking Online Forum Posts. International Journal of Speech Language and the Law, 18(1), pp.53–74.
RStudio (2011). RStudio: Integrated Development Environment for R. Boston, MA: RStudio, Inc.
Rudman, J. (1998). The State of Authorship Attribution Studies: Some Problems and Solutions. Computers and the Humanities, [online] 31(4), pp.351-365. Available at: https://doi.org/10.1023/A:100101862 [Accessed 10 Mar. 2017].
Sánchez Pol, M. (2005). A Stylometry-Based Method to Measure Intra and Inter-Authorial faithfulness for Forensic Applications. In: S. Argamon, J. Karlgren and J. Shanahan, ed., Stylistic Analysis Of Text For Information Access. [online] Kista: Swedish Institute of Computer Science, pp.11-15. Available at: https://pdfs.semanticscholar.org/d05f/32f6e28c2e20e4903ce9b9ff5b9d53d135b0.pdf [Accessed 11 Apr. 2018].
Santini, M. (2007). Characterizing Genres of Web Pages: Genre Hybridism and Individualization. In: Proceedings of the 40th Hawaii International Conference on System Sciences - 2007. [online] IEEE, pp.1-10. Available at: http://doi.org/10.1109/HICSS.2007.124 [Accessed 15 Apr. 2017].
Schmied, J. (1993). Qualitative and Quantitative Research Approaches to English Relative Constructions. In: C. Souter and E. Atwell, ed., Corpus-Based Computational Linguistics. Amsterdam: Rodopi, pp.85-96.
Schwartz, M. (2016). An Examination of Cross-Domain Authorship Attribution Techniques. CUNY Academic Works. [online] Available at: https://academicworks.cuny.edu/gc_etds/1573/ [Accessed 15 May 2018].
Schwartz, R., Tsur, O., Rappoport, A. and Koppel, M. (2013). Authorship Attribution of Micro-Messages. In: Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing. [online] Seattle, Washington, USA: Association for Computational Linguistics, pp.1880–1891. Available at: http://www.aclweb.org/anthology/D13-1193 [Accessed 9 Apr. 2018].
Scott, M. (2012). WordSmith Tools. Stroud: Lexical Analysis Software.
Shepherd, M. and Watters, C. (1998). The Evolution of Cybergenres. Proceedings of the Thirty-First Hawaii International Conference on System Sciences. [online] Available at: http://doi.org/10.1109/hicss.1998.651688 [Accessed 26 Jun. 2018].
Shuy, R. (2009). Ethical Questions in Forensic Linguistics: Introduction to Papers from a Linguistic Society of America Panel Presentation, San Francisco, California, January 9, 2009. International Journal of Speech Language and the Law, [online] 16(2), pp.219-226. Available at: https://journals.equinoxpub.com/index.php/IJSLL/article/view/6599 [Accessed 19 Jan. 2018].
Sinclair, J. (1988). Collins COBUILD Essential English Dictionary. London: HarperCollins.
Sinclair, J. (1991). Corpus, Concordance, Collocation. Oxford: Oxford University Press.
Sinclair, J. (2004). How to Use Corpora in Language Teaching. Amsterdam: John Benjamins.
Sixsmith, J. and Murray, C. (2001). Ethical Issues in the Documentary Data Analysis of Internet Posts and Archives. Qualitative Health Research, [online] 11(3), pp.423-432. Available at: https://doi.org/10.1177/104973201129119109 [Accessed 21 Feb. 2018].
Sketch Engine (2018). Sketch Engine | language corpus management and query system. [online] Sketchengine.eu. Available at: https://www.sketchengine.eu/ [Accessed 12 May 2018].
Smith, D., Spencer, S. and Grant, T. (2009). Authorship Analysis for Counter Terrorism. Unpublished Research Report. QinetiQ/Aston University.
Solan, L. and Tiersma, P. (2005). Speaking of Crime: The Language of Criminal Justice. Chicago: University of Chicago Press.
Solan, M. (2013). Intuition versus Algorithm: The Case of Forensic Authorship Attribution. Journal of Law and Policy, 21(2), pp.551-576.
Sousa Silva, R., Laboreiro, G., Sarmento, L., Grant, T., Oliveira, E. and Maia, B. (2011). ‘twazn me!!! ;(’ Automatic Authorship Analysis of Micro-Blogging Messages. In: Muñoz R., Montoyo A., Métais E. (eds) Proceedings of the 16th international conference on Natural Language Processing and Information Systems. NLDB 2011. Lecture Notes in Computer Science, vol 6716. [online] Berlin, Heidelberg: Springer, pp.161–168. Available at: https://doi.org/10.1007/978-3-642-22327-3_16 [Accessed 13 Jan. 2018].
Sousa-Silva, R., Sarmento, L., Grant, T., Oliveira, E. and Maia, B. (2010). Comparing Sentence-Level Features for Authorship Analysis in Portuguese. In: T. Pardo, A. Branco, A. Klautau, R. Vieira and V. de Lima, ed., Computational Processing of the Portuguese Language. PROPOR 2010. Lecture Notes in Computer Science, vol 6001. Berlin, Heidelberg: Springer, pp.51-54.
Spassova, M. and Turell,, M. (2007). The Use of Morphosyntactically Annotated Tag Sequences as Markers of Authorship. In: M. Turell, J. Cicres and M. Spassova, ed., Proceedings of the 2nd European IAFL Conference on Forensic Linguistics / Language and the Law 2006. Barcelona: Documenta Universitaria.
Stamatatos, E. (2007). Author Identification Using Imbalanced and Limited Training Texts. [online] pp.237-241. Available at: http://doi.org/10.1109/dexa.2007.5 [Accessed 19 Dec. 2017].
Stamatatos, E. (2008). Author Identification: Using Text Sampling to Handle the Class Imbalance Problem. Information Processing & Management, 44(2), pp.790-799.
Stamatatos, E. (2009). A Survey of Modern Authorship Attribution Methods. Journal of the American Society for Information Science and Technology, [online] 60(3), pp.538-556. Available at: https://doi.org/10.1002/asi.21001 [Accessed 5 May 2018].
Stamatatos, E. (2013). on the Robustness of Authorship Attribution Based on Character N-Gram Features. Journal of Law & Policy, 21(2), pp.421-439.
Stamatatos, E., Fakotakis, N. and Kokkinakis, G. (2001). Computer-Based Authorship Attribution Without Lexical Measures. Computers and the Humanities, [online] 35(2), pp.193-214. Available at: https://doi.org/10.1023/A:1002681919510 [Accessed 23 May 2017].
Stamatatos, E., Fakotakis, N. and Kokkinakis, G. (2000). Automatic Text Categorization in Terms of Genre and Author. Computational Linguistics, [online] 26(4), pp.471-495. Available at: https://doi.org/10.1162/089120100750105920 [Accessed 8 Mar. 2018].
Stubbs, M. (2005). Conrad in the Computer: Examples of Quantitative Stylistic Methods. Language and Literature, [online] 14(1), pp.5-24. Available at: https://doi.org/10.1177/0963947005048873 [Accessed 29 Mar. 2018].
Swales, J. (1990). Genre Analysis: English in Academic and Research Settings. Cambridge: Cambridge University Press.
Takci, H. and Ekinci, E. (2012). Character Level Authorship Attribution for Turkish Text Documents. The Online Journal of Science and Technology-TOJSAT, 2(3), pp.12-16.
Tas, T. and Gorur, A. (2007). Author Identification for Turkish Texts. Cankaya University Journal of Arts and Sciences, 1(7), pp.151-161.
Teknomo, K. (2015). Similarity Measurement. [online] People.revoledu.com. Available at: https://people.revoledu.com/kardi/tutorial/Similarity/ [Accessed 24 Oct. 2018].
Temur, T. and Vuruş, N. (2009). İnternet (Genel Ağ) Ortamında Türkçe’nin Kullanımına İliskin Bir Çözümleme. Balıkesir Üniversitesi Sosyal Bilimler Enstitüsü Dergisi, [online] 12(22), pp.232-244. Available at: http://sbe.balikesir.edu.tr/dergi/edergi/c12s22/makale/c12s22m16.pdf [Accessed 2 Feb. 2017].
Tereszkiewicz, A. (2013). Genre Analysis of Online Encyclopedias: The Case of Wikipedia. Krakow: Jagiellonian University Press.
The Law Commission (2011). Expert Evidence in Criminal Proceedings in England and Wales. [ebook] London: The Stationery Office. Available at: https://assets.publishing.service.gov.uk/government/uploads/system/uploads/attachment_data/file/229043/0829.pdf [Accessed 13 Jan. 2018].
Thurlow, C., Lengel, L. and Tomic, A. (2004). Computer Mediated Communication. London: Sage.
Tiersma, P. (2010). The Origins of Legal Language. In: L. Solan and P. Tiersma, ed., Oxford Handbook of Language and Law. [online] Loyola-LA Legal Studies Paper No. 2009-45, pp.1-26. Available at: https://ssrn.com/abstract=1695226 [Accessed 9 Mar. 2018].
Tiersma, P. and Solan, L. (2002). The Linguist on the Witness Stand: Forensic Linguistics in American Courts. Language, 78(2), pp.221-239.
Tinsley, H. and Weiss, D. (2000). Interrater Reliability and Agreement. In: H. Tinsley and S. Brown, ed., Handbook of Applied Multivariate Statistics and Mathematical Modeling, 1st ed. New York: Academic Press, pp.94-124.
Tognini-Bonelli, E. (2001). Corpus Linguistics at Work. Amsterdam: John Benjamins.
Tomblin, S. (2013). To Cut a Long Story Short an Analysis of Formulaic Sequences In Short Written Narratives and Their Potential as Markers of Authorship. PhD Thesis. Aston University.
Turell, M. (2011). The Use of Textual, Grammatical and Sociolinguistic Evidence in Forensic Text Comparison. International Journal of Speech Language and the Law, [online] 17(2), pp.211-250. Available at: https://journals.equinoxpub.com/index.php/IJSLL/article/view/6409 [Accessed 14 Jan. 2018].
Turell, M. and Gavaldà, N. (2013). Towards an Index of Idiolectal Similitude (or Distance) in Forensic Authorship Analysis. Journal of Law and Policy, 21(2), pp.495–514.
Turell, M. and Rosso, P. (2013). Computational Approaches to Plagiarism Detection and Authorship Attribution in Real Forensic Cases. In: R. Sousa-Silva, R. Faria, N. Gavaldà and B. Maia, ed., Bridging the Gap(s) between Language and the Law: Proceedings of the 3rd European Conference of the International Association of Forensic Linguists. Porto: Faculdade de Letras da Universidade do Porto., pp.19-30.
Turkish Criminal Procedure Code (2009). Criminal codes - Legislationline. [online] Legislationline.org. Available at: http://www.legislationline.org/documents/section/criminal-codes/country/50 [Accessed 11 Mar. 2016].
Türkoğlu, F., Diri, B. and Amasyalı, M. (2007). Author Attribution of Turkish Texts by Feature Mining. In: D. Huang, L. Heutte and M. Loog, ed., Advanced Intelligent Computing Theories and Applications. With Aspects of Theoretical and Methodological Issues. ICIC 2007. Lecture Notes in Computer Science, vol 4681. [online] Berlin, Heidelberg: Springer, pp.1086–1093. Available at: https://doi.org/10.1007/978-3-540-74171-8_110 [Accessed 16 Apr. 2018].
Twitter Blog (2017). English (US). [online] Blog.twitter.com. Available at: https://blog.twitter.com/ [Accessed 6 Jan. 2017].
Ustunova, K. (2002). Türkçede Yapı Kavramı ve Söz Dizimi İncelemeleri. Bursa: Uludağ Üniversitesi Basımevi.
van Halteren, H. (2004). Linguistic Profiling for Author Recognition and Verification. In: Proceedings of the 42nd Annual Meeting on Association for Computational Linguistics - ACL '04. [online] Association for Computational Linguistics. Available at: http://doi.org/10.3115/1218955.1218981 [Accessed 11 Feb. 2018].
van Halteren, H. (2007). Author verification by linguistic profiling. ACM Transactions on Speech and Language Processing, 4(1), pp.1-17.
Wellman, B. and Gulia, M. (1999). Net Surfers Don't Ride Alone: Virtual Communities as Communities. In: B. Wellman, ed., Networks in the Global Village. Boulder, Colo.: Westview Press, pp.331-366.
Wikicount.net. (2018). How many articles are there on Wikipedia? - Wikipedia article count. [online] Available at: http://wikicount.net/ [Accessed 15 Feb. 2018].
Wikipedia. (2018). Wikipedia, the free encyclopedia. [online] Available at: https://www.wikipedia.org/ [Accessed 15 Feb. 2018].
Williams, K. (2000). Reproduced and Emergent Genres of Communication on the World Wide Web. The Information Society, 16(3), pp.201-215.
Wright, D. (2013). Stylistic Variation within Genre Conventions in the Enron Email Corpus: Developing a Textsensitive Methodology for Authorship Research. International Journal of Speech Language and the Law, [online] 20(1). Available at: https://journals.equinoxpub.com/index.php/IJSLL/article/view/10595 [Accessed 10 May 2018].
Wright, D. (2014). Stylistics versus Statistics: A Corpus Linguistic Approach to Combining Techniques in Forensic Authorship Analysis Using Enron Emails. PhD thesis. University of Leeds.
Wright, D. (2017). Using Word N-Grams to Identify Authors and Idiolects. International Journal of Corpus Linguistics, [online] 22(2), pp.212-241. Available at: http://www.jbe-platform.com/content/journals/10.1075/ijcl.22.2.03wri [Accessed 9 May 2018].
Yaman, H. and Erdogan, Y. (2007). İnternet Kullanımının Türkçeye Etkileri: Nitel Bir Araştırma. Journal of Language and Linguistic Studies, 3(2), pp.237-249.
Yannikos, Y., Graner, L., Steinebach, M. and Winter, C. (2014). Data Corpora for Digital Forensics Education and Research. In: Peterson G., Shenoi S. (eds) Advances in Digital Forensics X. DigitalForensics 2014. IFIP Advances in Information and Communication Technology, vol 433. [online] Berlin, Heidelberg: Springer, pp.309-325. Available at: http://doi.org/10.1007/978-3-662-44952-3_21 [Accessed 9 Dec. 2017].
Yates, J. and Orlikowski, W. (1992). Genres of Organizational Communication: A Structurational Approach to Studying Communication and Media. Academy of Management Review, 17(2), pp.299-326.
Yates, J. and Orlikowski, W. (2002). Genre Systems: Structuring Interaction Through Communicative Norms. Journal of Business Communication, 39(1), pp.13-35.
Yule, G. (1939). On Sentence- Length as A Statistical Characteristic of Style in Prose: With Application to Two Cases of Disputed Authorship. Biometrika, [online] 30(3-4), pp.363-390. Available at: https://doi.org/10.1093/biomet/30.3-4.363 [Accessed 20 Mar. 2017].
Zappavigna, M. (2012). The Discourse of Twitter and Social Media: How we Use Language to Create Affiliation on the Web. London: Continuum.
Zheng, R., Li, J., Chen, H. and Huang, Z. (2006). A Framework for Authorship Identification of Online Messages: Writing-Style Features and Classification Techniques. Journal of the American Society for Information Science and Technology, [online] 57(3), pp.378-393. Available at: https://doi.org/10.1002/asi.20316 [Accessed 16 Nov. 2017].
Zheng, R., Qin, Y., Huang, Z. and Chen, H. (2003). Authorship Analysis in Cybercrime Investigation. In: H. Chen, R. Miranda, D. Zeng, C. Demchak, J. Schroeder and T. Madhusudan, ed., Intelligence and Security Informatics. ISI 2003. Berlin, Heidelberg: Springer, pp.59-73.
Zipf, G. (1932). Selected Studies of the Principle of Relative Frequency in Language. Cambridge, MA: Harvard University Press.
İndir
Gelecek
Lisans

Bu çalışma Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License ile lisanslanmıştır.