close
Skip to main content

NLP Collective

A collective focused on NLP (natural language processing), the transformation or extraction of useful information from natural language data.
38.3k Questions
+3
13.3k Members
+48
Contact

Pinned content

View all 2 collections

NLP admins have deemed these posts noteworthy.

Pinned
9 votes
2k views
Collection

Natural Language Processing FAQ

Frequently asked questions relating to NLP. Many of these may be questions that are often asked over and over, duplicates would likely be closed in favor of these. Add the best answer (using the ...
Berthold's user avatar
  • 101

Can you answer these questions?

View all unanswered questions

These questions still don't have an answer

Advice
0 votes
0 replies
37 views

LDA Topic Modeling: Balancing Coherence Score (C_v) vs. Discrepant Downstream Predictor Importances

I am a novice in topic modeling, and I would appreciate feedback and opinions from experts in the field. I am currently stuck on the concept of evaluating and finalizing my results. I am working on an ...
-2 votes
0 answers
92 views

UNEXPECTED: can be ignored when loading from different task/architecture; not ok if you expect identical arch

Given the following code snippet: from transformers import AutoTokenizer, AutoModelForSequenceClassification import torch device = torch.device("cuda" if torch.cuda.is_available() else "...
Tooling
0 votes
0 replies
66 views

Can I do a topic model for pre processing, find the topics I want and then do a second one?

Looking to do word disambiguition on a dataset with polyssemy. I already tried KB and supervised but they don't capture all the meanings, so I thought to do a first cluster, and then preprocess t the ...
0 votes
0 answers
98 views

In Kaggle, when executing pipeline(...) from transformers, I get 'RuntimeError: Cannot send a request, as the client has been closed.'

I am trying to run the following code in a Kaggle notebook, from transformers import pipeline pipeline = pipeline(task="text-generation", model="Qwen/Qwen2.5-1.5B") pipeline("...
0 votes
0 answers
59 views

How to handle I/O Memory issues with HuggingFace in Kaggle (SafeTensorError)?

I got the above error when I was trying to get a model from HuggingFace. I was using the AutoModelForCasualLM to get the model in question (this is a method directly from the transformers package). ...