TD-DeltaPi: A Model-Free Algorithm for Efficient Exploration.

Andrew G. Barto , Bruno Castro da Silva
national conference on artificial intelligence

2012
Autonomous Open-Ended Learning of Interdependent Tasks

Vieri Giuliano Santucci , Bruno Castro da Silva , Gianluca Baldassarre , Emilio Cartoni
arXiv: Artificial Intelligence

2
2019
Parameterized Melody Generation with Autoencoders and Temporally-Consistent Noise

Jim Tørresen , Bruno Castro da Silva , Aline Weber , Lucas N. Alegre
new interfaces for musical expression 174 -179

2019
Reinforcement Learning based Control of Traffic Lights in Non-stationary Environments: A Case Study in a Microscopic Simulator.

Denise de Oliveira , Ana LC Bazzan , Bruno Castro da Silva , Eduardo W Basso
european workshop on multi-agent systems

18
2006
Autonomous learning of multiple, context-dependent tasks.

Vieri Giuliano Santucci , Bruno Castro da Silva , Gianluca Baldassarre , Davide Montella
arXiv: Robotics

2
2020
Learning System-Efficient Equilibria in Route Choice Using Tolls

Ana Bazzan , Gabriel De Oliveira Ramos , Bruno Castro da Silva , Roxana Radulescu
adaptive and learning agents 1 -9

2
2018
Universal Off-Policy Evaluation.

Erik G. Learned-Miller , Emma Brunskill , Scott Niekum , Bruno Castro da Silva
arXiv: Learning

31
2021
Learning parameterized motor skills on a humanoid robot

Bruno Castro da Silva , Gianluca Baldassarre , George Konidaris , Andrew Barto
international conference on robotics and automation 5239 -5244

34
2014
A task-and-technique centered survey on visual analytics for deep learning model engineering

Rafael Garcia , Alexandru C Telea , Bruno Castro da Silva , Jim Tørresen
Computers & Graphics 77 30 -49

43
2018
Preventing undesirable behavior of intelligent machines.

Philip S. Thomas , Bruno Castro da Silva , Andrew G. Barto , Stephen Giguere
Science 366 ( 6468) 999 -1004

40
2019
Fairness guarantees under demographic shift

Stephen Giguere , Blossom Metevier , Yuriy Brun , Bruno Castro da Silva
Smpte Journal

10
2022
Model-Based Reinforcement Learning with SINDy

Rushiv Arora , Bruno Castro da Silva , Eliot Moss
arXiv preprint arXiv:2208.14501

2022
On ensuring that intelligent machines are well-behaved

Philip S Thomas , Bruno Castro da Silva , Andrew G Barto , Emma Brunskill
arXiv preprint arXiv:1708.05448

18
2017
Mosaic: uma ferramenta de análise espacial para o ambiente Cityzoom

Carlos Eduardo Scheidegger , Bruno Castro da Silva , Pablo Colossi Grazziotin
Salão de Iniciação Científica (13.: 2001: Porto Alegre). Livro de resumos. Porto Alegre: UFRGS, 2001.

3
2001
O modulor como sistema de design

Patrícia G Neuhaus , Bruno Castro da Silva , Carlos Eduardo Scheidegger , Rosirene Mayer
Salão de Iniciação Científica (13.: 2001: Porto Alegre). Livro de resumos. Porto Alegre: UFRGS, 2001.

2001
Accelerating multi-agent reinforcement learning with dynamic co-learning

Daniel Garant , Bruno Castro da Silva , Victor Lesser , Chongjie Zhang
Technical report, Tech. Rep.

15
2015
Biasing the behavior of organizationally adept agents

Daniel D Corkill , Chongjie Zhang , Bruno Castro da Silva , Yoonheui Kim
AAMAS 1309 -1310

2
2013