Skip to content

gradOutput Ignored #43

@parajain

Description

@parajain

function ReinforceCategorical:updateGradInput(input, gradOutput)

Hi,
I am trying to understand the logic in reinforce implementation. I am new to this so please bear with my basic questions.
Why is gradOutput being ignored? If we multiply gradOutput with rewards, will it be wrong?
Also, what is happening here:self.gradInput:copy(self.output)? Output is a probability distribution right?

Thanks,
Parag

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions