About
Łukasz is a Data Scientist with 8+ years of experience in various ML projects (social media monitoring, call center's transcriptions analysis, recommendation engines, information extraction from texts, legal texts analysis, and many more). He is finishing a PhD in Machine Learning, related to aspect-based sentiment analysis at the Wroclaw University of Science and Technology, Poland. He received MSc in Computer Science from the Wroclaw University of Technology in 2013 with distinction. He also received an MA in Law from Wroclaw University in 2014, and he is still actively interested in the legal aspects of IT and analysis of legal documents using NLP.
Research Interests
Education
PhD in Computer Science, Artificial Intelligence
Wroclaw University of Science and Technology
Science - Management - Commercialization
Cambridge University, UK · 2015
Master in Law
Wroclaw University · 2014
MsC in Computer Science (excellent grade)
Wroclaw University of Science and Technology · 2013
Publications
Why Aren't We NER Yet? Artifacts of ASR Errors in Named Entity Recognition
2023 · ACL 2023
Massively Multilingual Corpus of Sentiment Datasets and Multi-faceted Sentiment Classification Benchmark
2023 · NeurIPS 2023 (Datasets)
Electoral agitation data set: the use case of the Polish election
2022 · *LREC 2022 : Workshop Language Resources and Evaluation Conference : 20-25 June 2022 : First Workshop on Natural Language Processing for Political Sciences (PoliticalNLP) : proceedings*
Assessment of massively multilingual sentiment classifiers
2022 · *WASSA 2022 : The 12th Workshop on Computational Approaches to Subjectivity, Sentiment & Social Media Analysis, Proceedings of the Workshop, May 26, 2022.*
Fact-checking: relevance assessment of references in the Polish political domain
2021 · *Knowledge-Based and Intelligent Information & Engineering Systems: Proceedings of the 25th International Conference KES 2021*
Comprehensive analysis of aspect term extraction methods using various text embeddings
2021 · *Computer Speech and Language*
Political advertising dataset: the use case of the Polish 2020 presidential elections
2020 · *The Fourth Widening Natural Language Processing : Workshop Program of the Workshop, July 5, 2020, Seattle, USA : ACL 2020*
Return on investment in machine learning: crossing the chasm between academia and business
2020 · *Foundations of Computing and Decision Sciences*
WER we are and WER we think we are
2020 · *Findings of the Association for Computational Linguistics, Findings of ACL: EMNLP 2020, 16-20, November, 2020*
Punctuation prediction in spontaneous conversations: can we mitigate ASR errors with retrofitted word embeddings?
2020 · *Interspeech 2020 : 21th Annual Conference of the International Speech Communication Association, 25-29 October 2020, Shanghai, China*
Aspect detection using word and char embeddings with (Bi)LSTM and CRF
2019 · *2019 IEEE Second International Conference on Artificial Intelligence and Knowledge Engineering (AIKE), 3-5 June 2019, Cagliari, Sardinia, Italy : proceedings.*
WordNet2Vec: corpora agnostic word vectorization method
2019 · *Neurocomputing*
Avaya Conversational Intelligence: a real-time system for Spoken Language Understanding in human-human call center conversations
2019 · *Interspeech 2019 : 20th Annual Conference of the International Speech Communication Association, Graz, Austria, September 15th-19th 2019.*
Extracting aspects hierarchies using Rhetorical Structure Theory
2018 · *ACAI 2018 : Proceedings of the 2018 International Conference on Algorithms, Computing and Artificial Intelligence, Sanya, China, December 21-23, 2018.*
Method for aspect-based sentiment annotation using rhetorical analysis
2017 · *Intelligent Information and Database Systems : 9th Asian Conference, ACIIDS 2017, Kanazawa, Japan, April 3-5, 2017 : proceedings. Pt. 1*
Three is more interesting than two :words against publishing methods for sentiment analysis tested only for two classes
2016 · *14th Students' Science Conference : management and algorithms, 22-25 September, 2016.*
Zbadanie własności algorytmów rekomendacji biznesowych uwzględniających dodatkowe informacje o klientach i ich relacjach ze sprzedawcą w wirtualnej sieci sprzedaży
2016
Comprehensive study on lexicon-based ensemble classification sentiment analysis
2016 · *Entropy*
Fast and accurate - improving lexicon-based sentiment classification with an ensemble methods
2016 · *Intelligent Information and Database Systems : 8th Asian Conference, ACIIDS 2016, Da Nang, Vietnam, March 14-16, 2016 : proceedings. Pt. 2*
Sentiment analysis for Polish using transfer learning approach
2015 · *The Second European Network Intelligence Conference, ENIC 2015 : 21-22 September 2015, Karlskrona, Sweden : proceedings.*
Simpler is better? Lexicon-based ensemble sentiment classification beats supervised methods
2014 · *Proceedings of the 2014 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, ASONAM 2014 : Beijing, China, 17-20 August 2014*
Belief propagation method for word sentiment in WordNet 3.0
2014 · *Intelligent Information and Database Systems : 6th Asian Conference, ACIIDS 2014, Bangkok, Thailand, April 7-9, 2014 : proceedings. Pt. 2*
An approach to sentiment analysis of movie reviews: lexicon based vs. classification
2014 · *Hybrid artificial intelligence systems : 9th international conference, HAIS 2014, Salamanca, Spain, June 11-13, 2014 : proceedings*
