Author: David de Hilster
Python Project
With the official launch of the NLPPlus Python package, we are now focusing on the best way to introduce it to the Python community. The major message? NLPPlus is NLP that is 100% customizable. Where other NLP packages and toolkits are turnkey and supposedly do not require any customization, they […]
Read more
Elisha Silas
Elisha K. Silas is a computer scientist specializing in distributed systems and cloud computing, with a strong focus on scalable server-side application development and AI-driven solutions. He holds a bachelor’s degree in information systems and technologyfrom the United States International University Africa and brings four years of professional experience in designing […]
![]()
NLP Workshop for RVCE Coding Club
NLP++ authors David de Hilster and Amnon Meyers will be conducting a virtual workshop on NLP, NLP++, and Compilers from December 18 – 20, 2024. Sign Up For Each Session Here are the three sessions and a link to sign up. YOU MUST SIGN UP FOR EACH SESSION SEPARATELY. The […]
![]()
Building an NLP++ Brazilian Address Cleaner for HPCC SystemsÂ
Live Talk Open to the Public Watch a video of the live presentation of this NLP++ analyzer for Brazilian addresses during the 2024 HPCC Systems Community Summit. It took place on Wednesday, October 8, at 9:00 am EST USA online. It was free to register and attend. Using the NLP++ […]
![]()
Regex for NLP
Regex is ubiquitous in the programming world because of its usefulness as a rule-based text parsing language. Programmers find comfort in the idea of writing explicit, modifiable rules in order to parse text. This is in contrast with black-box statistical models, which cannot be modified when things go wrong – […]
![]()
ACL 2024 in Bangkok Thailand: Revelations of Old and New
I have been in computational linguistics for more than 40 years, and this is the first time I have been to the most important conference in our field: the annual Association of Computational Linguistics (ACL) Conference. As part of the registration process, I became a member for the first time […]
![]()
Guilherme Santos da Silva
Guilherme Santos da Silva has a degree in Computer Engineering from the Federal Technological University of Paraná, Brazil and is currently an employee of LexisNexis Risk Brazil. He discovered HPCC Systems in 2021 when he joined LexisNexis as an intern and participated in the 2021 HPCC Systems Poster Contest with […]
![]()
Scalable Analysis of English Dictionary Files on HPCC Systems Big Data Platform
Congratulations to Jayanth C on presenting his paper on Scalable Analysis of English Dictionary Files on HPCC Systems Big Data Platform at a conference in Japan. Read more about it on LinkedIn. Here are links to the paper online:
![]()
NLP++ and LLM
Trustworthy NLP systems must be rule and knowledge based given all statistical systems like large language models, machine learning, and neural networks are not. With the advent of large language models that can be queried about common knowledge, it is natural to use them to generate linguistic and world knowledge […]
![]()
Online Dictionary Creation Tool
Coming 2024 This is an exciting project that is currently being implemented. Expected first version release data in 2024. Description The dictionary tool is a web-based tool that will allow for quick creation of dictionaries in any language. Given there are few linguistic dictionaries available online for even major languages […]
![]()
Python Package for NLP++
The first version of our NLPPlus python package is ready to use. We are still waiting on approval of the package on the python package website, but it is available as a download from our GitHub. https://github.com/VisualText/py-package-nlpengine The NLPPlus python package for NLP++ allows Python programmers to call NLP++ analyzers […]
![]()
Portuguese Dictionary
The first steps in creating a portuguese dictionary has been started and can be found in the GitHub repository: http://github.com/VisualText/dict-pt-br. This was started by NLP++ co-author David de Hilster given he is fluent in Portuguese and that no digital dictionary for portuguese is available. Video Sessions This is a video […]
![]()
