Interview with Kate Candon: Leveraging express and implicit suggestions in human-robot interactions


On this interview sequence, we’re assembly a number of the AAAI/SIGAI Doctoral Consortium members to search out out extra about their analysis. Kate Candon is a PhD scholar at Yale College excited about understanding how we are able to create interactive brokers which are extra successfully capable of assist folks. We spoke to Kate to search out out extra about how she is leveraging express and implicit suggestions in human-robot interactions.

Might you begin by giving us a fast introduction to the subject of your analysis?

I examine human-robot interplay. Particularly I’m excited about how we are able to get robots to higher study from people in the way in which that they naturally educate. Sometimes, quite a lot of work in robotic studying is with a human instructor who is simply tasked with giving express suggestions to the robotic, however they’re not essentially engaged within the activity. So, for instance, you may need a button for “good job” and “unhealthy job”. However we all know that people give quite a lot of different alerts, issues like facial expressions and reactions to what the robotic’s doing, perhaps gestures like scratching their head. It might even be one thing like shifting an object to the facet {that a} robotic arms them – that’s implicitly saying that that was the improper factor at hand them at the moment, as a result of they’re not utilizing it proper now. These implicit cues are trickier, they want interpretation. Nevertheless, they’re a method to get extra info with out including any burden to the human person. Up to now, I’ve checked out these two streams (implicit and express suggestions) individually, however my present and future analysis is about combining them collectively. Proper now, we’ve a framework, which we’re engaged on bettering, the place we are able to mix the implicit and express suggestions.

When it comes to choosing up on the implicit suggestions, how are you doing that, what’s the mechanism? As a result of it sounds extremely troublesome.

It may be actually arduous to interpret implicit cues. Folks will reply otherwise, from individual to individual, tradition to tradition, and so forth. And so it’s arduous to know precisely which facial response means good versus which facial response means unhealthy.

So proper now, the primary model of our framework is simply utilizing human actions. Seeing what the human is doing within the activity may give clues about what the robotic ought to do. They’ve completely different motion areas, however we are able to discover an abstraction in order that we are able to know that if a human does an motion, what the same actions can be that the robotic can do. That’s the implicit suggestions proper now. After which, this summer season, we need to lengthen that to utilizing visible cues and facial reactions and gestures.

So what sort of situations have you ever been type of testing it on?

For our present undertaking, we use a pizza making setup. Personally I actually like cooking for example as a result of it’s a setting the place it’s straightforward to think about why these items would matter. I additionally like that cooking has this factor of recipes and there’s a system, however there’s additionally room for private preferences. For instance, anyone likes to place their cheese on prime of the pizza, so it will get actually crispy, whereas different folks prefer to put it underneath the meat and veggies, in order that perhaps it’s extra melty as an alternative of crispy. And even, some folks clear up as they go versus others who wait till the tip to take care of all of the dishes. One other factor that I’m actually enthusiastic about is that cooking might be social. Proper now, we’re simply working in dyadic human-robot interactions the place it’s one individual and one robotic, however one other extension that we need to work on within the coming yr is extending this to group interactions. So if we’ve a number of folks, perhaps the robotic can study not solely from the individual reacting to the robotic, but additionally study from an individual reacting to a different individual and extrapolating what that may imply for them within the collaboration.

Might you say a bit about how the work that you just did earlier in your PhD has led you up to now?

After I first began my PhD, I used to be actually excited about implicit suggestions. And I believed that I wished to deal with studying solely from implicit suggestions. One among my present lab mates was centered on the EMPATHIC framework, and was trying into studying from implicit human suggestions, and I actually appreciated that work and thought it was the path that I wished to enter.

Nevertheless, that first summer season of my PhD it was throughout COVID and so we couldn’t actually have folks come into the lab to work together with robots. And so as an alternative I did an internet examine the place I had folks play a sport with a robotic. We recorded their face whereas they have been taking part in the sport, after which we tried to see if we might predict primarily based on simply facial reactions, gaze, and head orientation if we might predict what behaviors they most popular for the agent that they have been taking part in with within the sport. We really discovered that we might decently properly predict which of the behaviors they most popular.

The factor that was actually cool was we discovered how a lot context issues. And I feel that is one thing that’s actually vital for going from only a solely teacher-learner paradigm to a collaboration – context actually issues. What we discovered is that generally folks would have actually huge reactions however it wasn’t essentially to what the agent was doing, it was to one thing that that they had finished within the sport. For instance, there’s this clip that I at all times use in talks about this. This individual’s taking part in and he or she has this actually noticeably confused, upset look. And so at first you may assume that’s unfavorable suggestions, regardless of the robotic did, the robotic shouldn’t have finished that. However in the event you really take a look at the context, we see that it was the primary time that she misplaced a life on this sport. For the sport we made a multiplayer model of Area Invaders, and he or she bought hit by one of many aliens and her spaceship disappeared. And so primarily based on the context, when a human seems at that, we really say she was simply confused about what occurred to her. We need to filter that out and never really contemplate that when reasoning concerning the human’s conduct. I feel that was actually thrilling. After that, we realized that utilizing implicit suggestions solely was simply so arduous. That’s why I’ve taken this pivot, and now I’m extra excited about combining the implicit and express suggestions collectively.

You talked about the express factor can be extra binary, like good suggestions, unhealthy suggestions. Would the person-in-the-loop press a button or would the suggestions be given by speech?

Proper now we simply have a button for good job, unhealthy job. In an HRI paper we checked out express suggestions solely. We had the identical area invaders sport, however we had folks come into the lab and we had a bit of Nao robotic, a bit of humanoid robotic, sitting on the desk subsequent to them taking part in the sport. We made it in order that the individual might give constructive or unfavorable suggestions through the sport to the robotic in order that it might hopefully study higher serving to conduct within the collaboration. However we discovered that individuals wouldn’t really give that a lot suggestions as a result of they have been centered on simply attempting to play the sport.

And so on this work we checked out whether or not there are other ways we are able to remind the individual to present suggestions. You don’t need to be doing it on a regular basis as a result of it’ll annoy the individual and perhaps make them worse on the sport in the event you’re distracting them. And in addition you don’t essentially at all times need suggestions, you simply need it at helpful factors. The 2 circumstances we checked out have been: 1) ought to the robotic remind somebody to present suggestions earlier than or after they struggle a brand new conduct? 2) ought to they use an “I” versus “we” framing? For instance, “keep in mind to present suggestions so I generally is a higher teammate” versus “keep in mind to present suggestions so we generally is a higher staff”, issues like that. And we discovered that the “we” framing didn’t really make folks give extra suggestions, however it made them really feel higher concerning the suggestions they gave. They felt prefer it was extra useful, type of a camaraderie constructing. And that was solely express suggestions, however we need to see now if we mix that with a response from somebody, perhaps that time can be a very good time to ask for that express suggestions.

You’ve already touched on this however might you inform us concerning the future steps you have got deliberate for the undertaking?

The massive factor motivating quite a lot of my work is that I need to make it simpler for robots to adapt to people with these subjective preferences. I feel when it comes to goal issues, like with the ability to decide one thing up and transfer it from right here to right here, we’ll get to some extent the place robots are fairly good. But it surely’s these subjective preferences which are thrilling. For instance, I like to cook dinner, and so I need the robotic to not do an excessive amount of, simply to perhaps do my dishes while I’m cooking. However somebody who hates to cook dinner may need the robotic to do all the cooking. These are issues that, even when you have the proper robotic, it could actually’t essentially know these issues. And so it has to have the ability to adapt. And quite a lot of the present desire studying work is so information hungry that you must work together with it tons and tons of instances for it to have the ability to study. And I simply don’t assume that that’s real looking for folks to really have a robotic within the house. If after three days you’re nonetheless telling it “no, if you assist me clear up the lounge, the blankets go on the sofa not the chair” or one thing, you’re going to cease utilizing the robotic. I’m hoping that this mixture of express and implicit suggestions will assist or not it’s extra naturalistic. You don’t should essentially know precisely the proper method to give express suggestions to get the robotic to do what you need it to do. Hopefully by all of those completely different alerts, the robotic will be capable to hone in a bit of bit sooner.

I feel an enormous future step (that’s not essentially within the close to future) is incorporating language. It’s very thrilling with how giant language fashions have gotten so significantly better, but additionally there’s quite a lot of fascinating questions. Up till now, I haven’t actually included pure language. A part of it’s as a result of I’m not totally certain the place it matches within the implicit versus express delineation. On the one hand, you may say “good job robotic”, however the way in which you say it could actually imply various things – the tone is essential. For instance, in the event you say it with a sarcastic tone, it doesn’t essentially imply that the robotic really did a very good job. So, language doesn’t match neatly into one of many buckets, and I’m excited about future work to assume extra about that. I feel it’s an excellent wealthy area, and it’s a means for people to be far more granular and particular of their suggestions in a pure means.

What was it that impressed you to enter this space then?

Actually, it was a bit of unintentional. I studied math and pc science in undergrad. After that, I labored in consulting for a few years after which within the public healthcare sector, for the Massachusetts Medicaid workplace. I made a decision I wished to return to academia and to get into AI. On the time, I wished to mix AI with healthcare, so I used to be initially enthusiastic about scientific machine studying. I’m at Yale, and there was just one individual on the time doing that, so I used to be the remainder of the division after which I discovered Scaz (Brian Scassellati) who does quite a lot of work with robots for folks with autism and is now shifting extra into robots for folks with behavioral well being challenges, issues like dementia or anxiousness. I believed his work was tremendous fascinating. I didn’t even notice that that type of work was an possibility. He was working with Marynel Vázquez, a professor at Yale who was additionally doing human-robot interplay. She didn’t have any healthcare initiatives, however I interviewed together with her and the questions that she was enthusiastic about have been precisely what I wished to work on. I additionally actually wished to work together with her. So, I unintentionally stumbled into it, however I really feel very grateful as a result of I feel it’s a means higher match for me than the scientific machine studying would have essentially been. It combines quite a lot of what I’m excited about, and I additionally really feel it permits me to flex backwards and forwards between the mathy, extra technical work, however then there’s additionally the human factor, which can be tremendous fascinating and thrilling to me.

Have you ever bought any recommendation you’d give to somebody pondering of doing a PhD within the discipline? Your perspective will probably be significantly fascinating since you’ve labored exterior of academia after which come again to start out your PhD.

One factor is that, I imply it’s type of cliche, however it’s not too late to start out. I used to be hesitant as a result of I’d been out of the sector for some time, however I feel if you will discover the proper mentor, it may be a very good expertise. I feel the largest factor is discovering a very good advisor who you assume is engaged on fascinating questions, but additionally somebody that you just need to study from. I really feel very fortunate with Marynel, she’s been a superb advisor. I’ve labored fairly intently with Scaz as properly they usually each foster this pleasure concerning the work, but additionally care about me as an individual. I’m not only a cog within the analysis machine.

The opposite factor I’d say is to discover a lab the place you have got flexibility in case your pursuits change, as a result of it’s a very long time to be engaged on a set of initiatives.

For our remaining query, have you ever bought an fascinating non-AI associated truth about you?

My essential summertime pastime is taking part in golf. My complete household is into it – for my grandma’s a hundredth celebration we had a household golf outing the place we had about 40 of us {golfing}. And truly, that summer season, when my grandma was 99, she had a par on one of many par threes – she’s my {golfing} function mannequin!

About Kate

Kate Candon is a PhD candidate at Yale College within the Pc Science Division, suggested by Professor Marynel Vázquez. She research human-robot interplay, and is especially excited about enabling robots to higher study from pure human suggestions in order that they will develop into higher collaborators. She was chosen for the AAMAS Doctoral Consortium in 2023 and HRI Pioneers in 2024. Earlier than beginning in human-robot interplay, she acquired her B.S. in Arithmetic with Pc Science from MIT after which labored in consulting and in authorities healthcare.




AIhub
is a non-profit devoted to connecting the AI neighborhood to the general public by offering free, high-quality info in AI.


AIhub
is a non-profit devoted to connecting the AI neighborhood to the general public by offering free, high-quality info in AI.



Lucy Smith
is Managing Editor for AIhub.