Skip to main content
The 2026 Annual Developer Survey is live— take the Survey today!

NLP Collective

A collective focused on NLP (natural language processing), the transformation or extraction of useful information from natural language data.
38.3k Questions
+2
13.4k Members
+20
Contact

Pinned content

View all 2 collections

NLP admins have deemed these posts noteworthy.

Pinned
Score of 9
1973 views
Collection

Natural Language Processing FAQ

Frequently asked questions relating to NLP. Many of these may be questions that are often asked over and over, duplicates would likely be closed in favor of these. Add the best answer (using the ...

Can you answer these questions?

View all unanswered questions

These questions still don't have an answer

Advice
0 votes
0 replies
71 views

Does my dual-KG Edge RAG make sense and how to Evaluate? Connecting 2 Knowledge Graphs. BA thesis

I'm working on a KG-RAG system for Labor Law and company HR policies for my BA thesis due in 2 weeks and I just realized some problems with the KG. I have 2 questions: 1 regarding the Edge called ...
Advice
0 votes
0 replies
54 views

Inferencing in GLiNER model

I evaluated multiple GLiNER v2.1 models (small, medium, and large) on CoNLL-2002 and WikiANN English test datasets using the same inference pipeline. Surprisingly, GLiNER Small v2.1 and Medium v2.1 ...
Tooling
0 votes
0 replies
66 views

Best pre-trained vision model for multi-plant disease detection in async web back-end

I'm building a web app with FastAPI + async/await Python backend. Users upload leaf photos via API and the server should return: 1) plant species, 2) disease label or "healthy". Constraints: ...
Advice
0 votes
0 replies
42 views

LDA Topic Modeling: Balancing Coherence Score (C_v) vs. Discrepant Downstream Predictor Importances

I am a novice in topic modeling, and I would appreciate feedback and opinions from experts in the field. I am currently stuck on the concept of evaluating and finalizing my results. I am working on an ...
Tooling
0 votes
0 replies
67 views

Can I do a topic model for pre processing, find the topics I want and then do a second one?

Looking to do word disambiguition on a dataset with polyssemy. I already tried KB and supervised but they don't capture all the meanings, so I thought to do a first cluster, and then preprocess t the ...