Skip to content

Can you include code for this to handle GPT-Neo-125M #308

@thistleknot

Description

@thistleknot

I was considering diving into the source to do it, but I don't see any clear indications of huggingface models. I assume you must be downloading the weights and then applying them to an empty pytorch model.

Activity

thistleknot

thistleknot commented on Jan 15, 2023

@thistleknot
Author

I'm actually trying to borrow a prompt engineering method used for simple-gpt2
https://github.com/RossSong/GPT2-Question-Answering/blob/master/QA.ipynb
for retooling with happytransformer.

EricFillion/happy-transformer#310

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

      Development

      No branches or pull requests

        Participants

        @thistleknot

        Issue actions

          Can you include code for this to handle GPT-Neo-125M · Issue #308 · minimaxir/gpt-2-simple