EleutherAI
/

gpt-j-6b

Text Generation

Model card Files Files and versions

Resources

View closed (19)

TemporalMesh Transformer: 29.4 PPL at 48% compute — dynamic graph attention + adaptive exit gates (open-source, 226 tests)

#51 opened about 1 month ago by

Create app

#48 opened about 1 year ago by

hoomanshirtavani

I know that vocab size and embeddings size are different

#46 opened over 1 year ago by

Upload IMG-20240702-WA0000.jpg

#45 opened over 1 year ago by

Request: DOI

#43 opened over 1 year ago by

gpt-j for thread detectioon

#41 opened over 2 years ago by

Adding `safetensors` variant of this model

#40 opened over 2 years ago by

[AUTOMATED] Model Memory Requirements

#39 opened over 2 years ago by

model-sizer-bot

Adding Evaluation Results

#38 opened over 2 years ago by

leaderboard-pr-bot

Best Way to Load a Model After Training w/o Requantizing

#37 opened almost 3 years ago by

Update README.md

#36 opened almost 3 years ago by

Error import transformers.models.gptj_modelling-gptj

#35 opened almost 3 years ago by

text to sql

#34 opened almost 3 years ago by

GPTJForCausalLM LM head weights not initialized?

#33 opened almost 3 years ago by

How to get Sentence embeddings?

#32 opened about 3 years ago by deleted

Whats the difference between GPT-J and Pythia?

#31 opened about 3 years ago by

Deployment and infrastructure requirement for GPT-J

#29 opened about 3 years ago by

file input format

#27 opened about 3 years ago by

ValueError: Attempting to unscale FP16 gradients.

#26 opened about 3 years ago by

Tokenizer for GPT-J-6B fails when trying to fine-tune for GLUE tasks

#24 opened about 3 years ago by

Tokenizer loading issue

#23 opened about 3 years ago by

try1

#21 opened over 3 years ago by

Is there a float16 version?

#20 opened over 3 years ago by

RuntimeError: expected scalar type Half but found Float

#19 opened over 3 years ago by

Can this model be used for the Generative Question Answering?

#18 opened over 3 years ago by

Update config.json

#17 opened over 3 years ago by

How do you download the whole pack of files?

#16 opened over 3 years ago by

How to fine tune or train with our own data?

#15 opened over 3 years ago by

How can we add ability remember the conversation ??

#14 opened over 3 years ago by

Telegram Info Bot

#13 opened over 3 years ago by

GPTJForCausalLM hogs memory - inference only

#9 opened over 3 years ago by