MLSP-25: Reinforcement Learning 1 |
| Session Type: Poster |
| Time: Thursday, 10 June, 13:00 - 13:45 |
| Location: Gather.Town |
| Virtual Session: View on Virtual Platform |
| Session Chair: Chang Yoo, Korea Advanced Institute of Science and Technology |
| MLSP-25.1: COOPERATIVE SCENARIOS FOR MULTI-AGENT REINFORCEMENT LEARNING IN WIRELESS EDGE CACHING |
| Navneet Garg; University of Edinburgh |
| Tharmalingam Ratnarajah; University of Edinburgh |
| MLSP-25.2: ROBUST DEEP REINFORCEMENT LEARNING FOR UNDERWATER NAVIGATION WITH UNKNOWN DISTURBANCES |
| Juan Parras; Universidad Politécnica de Madrid |
| Santiago Zazo; Universidad Politécnica de Madrid |
| MLSP-25.3: ONLINE HYPER-PARAMETER TUNING FOR THE CONTEXTUAL BANDIT |
| Djallel Bouneffouf; IBM Research |
| Emmanuelle Claeys; Strasbourg University |
| MLSP-25.4: DOUBLE-LINEAR THOMPSON SAMPLING FOR CONTEXT-ATTENTIVE BANDITS |
| Djallel Bouneffouf; IBM Research |
| Raphael Feraud; Orange |
| Sohini Upadhyay; IBM Research |
| Yasaman Khazaeni; Universite de montreal |
| Irina Rish; Universite de montreal |
| MLSP-25.5: ON THE MARGINAL BENEFIT OF ACTIVE LEARNING: DOES SELF-SUPERVISION EAT ITS CAKE? |
| Yao-Chun Chan; University of California, Riverside |
| Mingchen Li; University of California, Riverside |
| Samet Oymak; University of California, Riverside |
| MLSP-25.6: ROBUST MAML: PRIORITIZATION TASK BUFFER WITH ADAPTIVE LEARNING PROCESS FOR MODEL-AGNOSTIC META-LEARNING |
| Thanh Nguyen; Korea Advanced Institute of Science and Technology (KAIST) |
| Tung Luu; Korea Advanced Institute of Science and Technology (KAIST) |
| Trung Pham; Korea Advanced Institute of Science and Technology (KAIST) |
| Sanzhar Rakhimkul; Korea Advanced Institute of Science and Technology (KAIST) |
| Chang Dong Yoo; Korea Advanced Institute of Science and Technology (KAIST) |