Search results
Filter
Filetype
Your search for "*" yielded 541442 hits
Exciting action : investigating efficient exploration for learning musculoskeletal humanoid locomotion
Learning a locomotion controller for a musculoskeletal system is challenging due to over-actuation and high-dimensional action space. While many reinforcement learning methods attempt to address this issue, they often struggle to learn human-like gaits because of the complexity involved in engineering an effective reward function. In this paper, we demonstrate that adversarial imitation learning c
Fast kinodynamic planning on the constraint manifold with deep neural networks
Motion planning is a mature area of research in robotics with many well-established methods based on optimization or sampling the state space, suitable for solving kinematic motion planning. However, when dynamic motions under constraints are needed and computation time is limited, fast kinodynamic planning on the constraint manifold is indispensable. In recent years, learning-based solutions have
En vegetarisk lunch på Hartekamp : – och andra notiser om Linné ur en svensk sjöofficers dagbok
Genomgång av samtliga de tillfällen då Carl von Linné omnämns i Carl Tersmedens memoarer inklusive kommentarer, bland annat rörande kronologiska diskrepanser.
A retrospective on the robot air hockey challenge : benchmarking robust, reliable, and safe learning techniques for real-world robotics
Machine learning methods have a groundbreaking impact in many application domains, but their application on real robotic platforms is still limited. Despite the many challenges associated with combining machine learning technology with robotics, robot learning remains one of the most promising directions for enhancing the capabilities of robots. When deploying learning-based approaches on real rob
LS-IQ : implicit reward regularization for inverse reinforcement learning
Recent methods for imitation learning directly learn a Q-function using an implicit reward formulation rather than an explicit reward function.However, these methods generally require implicit reward regularization to improve stability and often mistreat absorbing states.Previous works show that a squared norm regularization on the implicit reward function is effective, but do not provide a theore
Robust localization, mapping, and navigation for quadruped robots
Quadruped robots are currently a widespread platform for robotics research, thanks to powerful Reinforcement Learning controllers and the availability of cheap and robust commercial platforms. However, to broaden the adoption of the technology in the real world, we require robust navigation stacks relying only on low-cost sensors such as depth cameras. This paper presents a first step towards a ro
Dimensionality reduction and prioritized exploration for policy search
Black-box policy optimization is a class of reinforcement learning algorithms that explores and updates the policies at the parameter level. This class of algorithms is widely applied in robotics with movement primitives or non-differentiable policies. Furthermore, these approaches are particularly relevant where exploration at the action level could cause actuator damage or other safety issues. H
Regularized deep signed distance fields for reactive motion generation
Autonomous robots should operate in real-world dynamic environments and collaborate with humans in tight spaces. A key component for allowing robots to leave structured lab and manufacturing settings is their ability to evaluate online and real-time collisions with the world around them. Distance-based constraints are fundamental for enabling robots to plan their actions and act safely, protecting
Learning stable vector fields on Lie Groups
Learning robot motions from demonstration requires models able to specify vector fields for the full robot pose when the task is defined in operational space. Recent advances in reactive motion generation have shown that learning adaptive, reactive, smooth, and stable vector fields is possible. However, these approaches define vector fields on a flat Euclidean manifold, while representing vector f
Long-term visitation value for deep exploration in sparse-reward reinforcement learning
Reinforcement learning with sparse rewards is still an open challenge. Classic methods rely on getting feedback via extrinsic rewards to train the agent, and in situations where this occurs very rarely the agent learns slowly or cannot learn at all. Similarly, if the agent receives also rewards that create suboptimal modes of the objective function, it will likely prematurely stop exploring. More
Continuous action reinforcement learning from a mixture of interpretable experts
Reinforcement learning (RL) has demonstrated its ability to solve high dimensional tasks by leveraging non-linear function approximators. However, these successes are mostly achieved by 'black-box' policies in simulated domains. When deploying RL to the real world, several concerns regarding the use of a 'black-box' policy might be raised. In order to make the learned policies more transparent, we
Accountability for Human Rights Atrocities in Transitional Societies
Remissyttrande: Omarbetat direktiv om luftkvalitet och renare luft i Europa – författningsförslag med anledning av bestämmelserna i det nya luftkvalitetsdirektivet om tillgång till rättslig prövning och rätt till skadestånd samt Naturvårdsverkets delredovisning av regeringsuppdrag med förslag till genomförande av det nya luftkvalitetsdirektivet
Riskanalys av vätgas- och vätgasfabriken vid Ringhals
The report is a risk analysis of the hydrogen production plant at Ringhals Nuclear Power Plant, Sweden. The report also discusses potential hazards from the handling and storage of hydrogen gas. Principles of hydrogen gas detection and sensor technology are presented. A HAZOP-analysis of the plant is carried out, and recommendations of improvements based on the results of the analysis are made. Co
Släckmedel och släcksystem alternativ för nutida och framtida stridsfartyg.
An inventory and analysis of extinguishing agents and systems. A quantitative analysis of the weight of the systems and of the damage on hull material for different fire scenarios. Tests have been carried out in cone calorimeter and simulations in HSLAB and FREIA:
Riskanalys tågtunnel
A probabilistic riskanalysis of a railway tunnel in Örnsköldsvik, Sweden. Using societal risk showing the hazard of the system. The analysis indicates a high risk greater than recommended levels of acceptance according to international risk criteria. Furthermore proposal for hazard prevention in tunnels are made.
Riskanalys - Storskalig kemikaliehantering
Riskbaserad brandteknisk analys av diskotekslokaler
The purpose of this study is to evaluate the safety of people located in discotheques in case of fire. A quantitative fire riskanalysis (QRA) for three different types of discotheques is carried out. The analysis is based on performance based codes from which fire protection alternatives are analysed. The results of the quantitative fire riskanalysis are described as risk profiles and average risk
Fullskaleförsök av brand i ett rum med boendesprinkler
A serie of ten full scale fire scenerios has been analysed in a compartment with and without residentiual sprinkler. The production of carbon monoxide, carbon dioxide and oxygen was measered in two different heights in the compartment during the tests. (Swedish)
