A proper way to raycast from screen to 3D world space

Godot Version

Godot 4.2.2.stable

Question

Hi folks!

I’m developing a simple top-down 3D shooter and I have questions about converting screen coordinates to 3D world coordinates using raycasts.

To make the main character rotate around its axis and aim in a specific direction, the way I’m doing is to capture the mouse position on the screen and project the 2D coordinates to the 3D coordinates using raycast from the Camera3D. However, the way it is done in Godot seems to me a little bit laborious.

After reading the official documentation and watching some videos related to the topic, I came with this solution:

func _unhandled_input(event: InputEvent) -> void:
	if event is InputEventMouseMotion:
		const RAY_LENGTH = 5000.0
		
		var camera = get_viewport().get_camera_3d()
		var ray_origin = camera.project_ray_origin(event.position)
		var ray_end = ray_origin + camera.project_ray_normal(event.position) * RAY_LENGTH
		
		var space_state = get_world_3d().direct_space_state
		var intersection = space_state.intersect_ray(PhysicsRayQueryParameters3D.create(ray_origin, ray_end))
		if not intersection.is_empty():
			aim_at(intersection.position)

Comparing to Unity, where we can do something like this:

RaycastHit hit;
Ray ray = camera.ScreenPointToRay(Input.mousePosition);
if (Physics.Raycast(ray, out hit)) {
	// Do something
}

in Godot, I had to do all that stuff to perform the same type of operation.

So my question(s) is/are:

  • In Godot, is this the recommended/official way of doing this type of raycast?
  • Should I always specify a length for the raycast or is it possible to do with an infinite length?
  • Is there a more practical/simplified way of doing this type of operation?

Thanks in advance! :slight_smile:

Actually the best way in my oppinion is with CollisionObject3D — Godot Engine (stable) documentation in English with the signal input_event.

You pretty much do not need to do any transforms, just use the position argument or make a ray cast at all.

EDIT:
But, actually, I think what you could do is to not simply use any raycast at all, or any colliders, you could just transform your character into screen coordinates, detect your mouse click, then detect & set the angle between your screen character coordinates and mouse.

I guess this is the way to do it in Godot.

Does it have to be a raycast?

If not, the camera 3d has some methods built in that can help with projections.
Maybe this can work too (Did only a quick test, so no guarantie):

extends Camera3D


func _process(delta: float) -> void:
	var mouse_pos = get_viewport().get_mouse_position()
	var world_pos = project_position(mouse_pos, position.z)
	print(world_pos)
1 Like

The first thing about Godot raycast that might not make any sense to anyone who base their ideas/concept/application from Unity is the difference in how both manage the order and access to the physics data.

Unity works with stacks which allows data to be considered in the calculation of the physics engine even when outside the Update() phase during which physics are applied. As such, if there’s a change coded outside of the Update() function that affects the physics of an object, that change will be considered by raycast and other kind of physics related calculation if done after said changes, but also before the next physics loop.

Godot doesn’t work with stacks on object/nodes, but with a snapshot of the scene state which is only updated during the _PhysicsProcess(). (I’ll immediately point out that I’m not considering/writing about how multi-threading the physics onto a separate thread and how it would change things with Godot.) Due to that, whenever you cast a raycast, you’re working with the latest updated state which might, depending on where you call it, be obsolete already.

What you might not know about Unity’s camera.ScreenPointToRay(); function is that it’s does something a really similar thing to what you do in Godot with only the caveat of having to update the 3D space state separately. If you look at the source code of what ScreenPointToRay() does behind, you would get that it’s not just as simple as it looks. As such, you could do an extremely similar function/tool in Godot by creating your own Ray class.

For example, this is what I would write in the script (in C#) that would require the raycast:

    [Export] public Node3D DebugAimSphere;
    private Vector3 AimLocation;
    private Ray ray;
    private Dictionary CamRaycastResult;

    [Export] private float CamRayDistance = 1000f;

    public override void _Ready()
    {
        base._Ready();
        ray = new Ray();
    }

    public override void _Input(InputEvent _ie)
    {
        base._Input(_ie);
        CamRaycastResult = ray.RaycastScreenPointToRay(GetViewport().GetMousePosition(), GetViewport().GetCamera3D(), CamRayDistance);
        if (CamRaycastResult.Count > 0)
        {
            if (DebugAimSphere != null)
            {
                AimLocation = (Vector3)CamRaycastResult["position"];
                DebugAimSphere.GlobalPosition = AimLocation;
            }
        }
    }

and I would have this as a Ray.cs file in my project:

using Godot;
using Godot.Collections;

public partial class Ray
{
    private Vector3 Raycast_Start = new Vector3();
    private Vector3 Raycast_End = new Vector3();
    private PhysicsRayQueryParameters3D CamRaycastQuery;

    public Dictionary RaycastScreenPointToRay(Vector2 mousePosition, Camera3D curCam, float rayDistance)
    {
        if (WorldManager.Instance == null || WorldManager.Instance.space3D == null)
        {
            GD.Print("Space 3D is not set yet.");
            return new Dictionary();
        }
        Raycast_Start = curCam.ProjectRayOrigin(mousePosition);
        Raycast_End = Raycast_Start + (curCam.ProjectRayNormal(mousePosition) * rayDistance);
        return Raycast();
    }

    private Dictionary Raycast()
    {
        CamRaycastQuery = PhysicsRayQueryParameters3D.Create(Raycast_Start, Raycast_End, Constants.Ref_Mask_PlayerInteraction);
        CamRaycastQuery.CollideWithBodies = true;
        return WorldManager.Instance.space3D.IntersectRay(CamRaycastQuery);
    }
}

The reason why I separate the PhysicsRayQueryParameters3D into a separate function in the Ray class is that I could add more functions later that wouldn’t be based on the camera, but maybe just 2 different locations/nodes/objects positions.

As I previously mentioned, only the PhysicsDirectSpaceState3D is kinda “unique” to Godot. In the code above, I set it up so that the world state is generated only once on a parent Node3D that exists at all time and this has the following script:

using Godot;

public partial class WorldManager : Node3D
{
    private static WorldManager _instance;

    public static WorldManager Instance => _instance;

    public PhysicsDirectSpaceState3D space3D { get; private set; }

    public override void _EnterTree()
    {
        if (_instance != null && _instance != this)
        {
            _instance.QueueFree(); // A singleton of the World Manager already exist. Delete it.
        }
        _instance = this;
    }

    public override void _PhysicsProcess(double delta)
    {
        base._PhysicsProcess(delta);
        space3D = GetWorld3D().DirectSpaceState;
    }
}

As I previously mentioned, Godot uses a snapshot of the current scene’s content (state) for its physics. that space3D of type PhysicsDirectSpaceState3D is basically a copy of that snapshot. Whenever you do raycast, you basically ask the physic engine to look at that stored state and check if there’s a contact or not.