Wouldn't make any sense for an actual screenplay, but would be really cool/potentially useful when, say, using it to format D&D session logs, since all the characters have tokens in that anyway.
Probably just add a new parser function to set the images/targets, and an RL module to turn it into styles that can be applied to the speaker lines themselves.