Deep policy gradient methods without batch updates, target networks, or replay buffers

From National Research Council Canada

Download
  1. (PDF, 591.4 MiB)
Linkhttps://proceedings.neurips.cc/paper_files/paper/2024/file/019ef89617d539b15ed610ce8d1b76e1-Paper-Conference.pdf
AuthorSearch for: ; Search for: ; Search for: ; Search for: ; Search for: ; Search for: 1ORCID identifier: https://orcid.org/0000-0002-3567-7834; Search for: ; Search for:
Affiliation
  1. National Research Council Canada. Digital Technologies
FormatText, Article
Conference38th Conference on Neural Information Processing Systems, NeurIPS 2024, December 10-15, 2024, Vancouver, BC, Canada
Abstract
Publication date
PublisherNeural Information Processing Systems Foundation
Licence
In
Series
Related data
Other format
LanguageEnglish
Peer reviewedYes
Export citationExport as RIS
Report a correctionReport a correction (opens in a new tab)
Record identifier14988d94-5936-492f-bcd1-18cdd3161579
Record created2025-04-10
Record modified2025-04-14
Date modified: