You should be able to follow links by repeating back it's text either via speech or typing.
To expand upon this design, if only a single link matches the browser would immediately start loading that page and announce that it's doing so. But if multiple links match it'll list each link's text, title, and href alongside a shorthand letter with which to refer to it in your response.
These links would be defined very generally, and include:
These different types of links would be styled differently during the clarification stage so users are clear what's a part of the browser and what's part of the web. This distinction may be necessary for security, and I'll struggle to maintain it otherwise.