= ROS for Human-Robot Interaction =

ROS for Human-Robot Interaction (or ROS4HRI) is an umbrella for all the ROS packages, conventions and tools that help developing interactive robots with ROS.

{{attachment:ros4hri_ids.png||width=500}}


<<TableOfContents(3)>>

== ROS4HRI conventions ==

The [[https://www.ros.org/reps/rep-0155.html | ROS REP-155]] (aka, ''ROS4HRI'') defines a set of topics, naming conventions, frames that are important for HRI application. It was originally introduced in the paper [[https://doi.org/10.1109/IROS51168.2021.9636816 | 'ROS for Human-Robot Interaction']], presented at IROS2021.

The REP-155 is still evolving. On-going changes can be submitted and discussed on the [[https://github.com/ros-infrastructure/rep/pulls?q=is%3Apr+rep-0155+ | ros-infrastructure/rep Github repository]].

== Common ROS packages ==

 * [[hri_msgs]] (available in noetic and humble): base ROS messages for Human-Robot Interaction perception
 * [[https://github.com/ros4hri/hri_actions_msgs|hri_actions_msgs]] (available in noetic and humble): base ROS messages for Human-Robot Interaction actions (including human intents representation)
 * [[human_description]] (available in noetic and humble): a parametric kinematic model of a human, in URDF format
 * [[libhri]] (available in noetic and humble): a C++ library to easily access human-related topics
 * [[pyhri]] (available in noetic and humble): a Python library to easily access human-related topics
 * [[hri_rviz]] (available in noetic, source supports ROS 2): a collection of RViz plugins to visualise faces, facial landmarks, 3D kinematic models...
 * [[https://github.com/ros4hri/rqt_human_radar|rqt_human_radar]] (ROS 1, ROS 2): [[rqt]] plugin, top-down view of the social surroundings of the robot. 


The source code for these packages (as well as several others) can be found on [[https://github.com/ros4hri | github.com/ros4hri]]. 

== Specialized ROS packages ==

''Feel free to add your own packages to this list, as long as they implement the REP-155.''

=== Face detection, recognition, analysis ===

 * [[hri_face_detect]] (ROS1 & ROS2): a Google MediaPipe-based multi-people face detector.
  Supports:
  * facial landmarks
  * 3D head pose estimation
  * 30+ FPS on CPU only

=== Body tracking, gesture recognition ===


{{attachment:frames.png}}

 * [[https://github.com/ros4hri/hri_body_detect|hri_body_detect]] (ROS2): a Google MediaPipe-based 3D full-body pose estimator
  Supports:
  * 2D and 3D pose estimation of a multiple people
  * optionally, can use registered depth information to improve 3D pose estimation 


 * [[hri_fullbody]] (ROS1 & ROS2): a Google MediaPipe-based 3D full-body pose estimator
  Supports:
  * 2D and 3D pose estimation of a single person (multiple person pose estimation possible with an external body detector)
  * facial landmarks
  * optionally, can use registered depth information to improve 3D pose estimation 


=== Voice processing, speech, dialogue understanding ===

=== Whole person analysis ===

 * [[hri_person_manager]] (ROS1 & ROS2): probabilistic fusion of faces, bodies, voices into unified persons.

=== Group interactions, gaze behaviour ===

 * [[https://github.com/ros4hri/hri_engagement | hri_engagement]] (ROS1 & ROS2): real-time engagement assessment, based on visual metrics (gaze direction, proximity).

== Tutorials ==

You can access the [[hri/Tutorials | ROS4HRI tutorials here]].

Even more tutorials on the RO4HRI Github page: https://ros4hri.github.io/ros4hri-tutorials/