Get Started with Rod

Requirements

Golang is the only requirement, you don't even need to know anything about HTML.

If you have never used Golang, install it and you can master it in hours: A tour of Go.

First program

Let's use Rod to open a page and take a screenshot of it, first, create a "main.go" file with the content below:

package main

import "github.com/go-rod/rod"

func main() {
    page := rod.New().MustConnect().MustPage("https://www.wikipedia.org/")
    page.MustWaitLoad().MustScreenshot("a.png")
}

The rod.New creates a browser object, the MustConnect launches and connects to a browser. The MustPage creates a page object, it's like a page tab in the browser. The MustWaitLoad waits for the page is fully loaded. The MustScreenshot takes a screenshot of the page.

Create a module:

go mod init learn-rod

Run the module:

go run .

The program will output a screenshot "a.png" like the one below:

See what's under the hood

By default, Rod will disable the browser's UI to maximize the performance. But when developing an automation task we usually care more about the ease of debugging. Rod provides a lot of solutions to help you debug the code.

Let's create a ".rod" config file under the current working directory. The content is:

show

It means "show the browser UI on the foreground". Before we run the module again, let's append time.Sleep(time.Hour) to the end the code so that it won't be too fast for our eyes to catch it, the code of "main.go" now becomes:

package main

import (
	"time"

	"github.com/go-rod/rod"
)

func main() {
	page := rod.New().MustConnect().MustPage("https://www.wikipedia.org/")
	page.MustWaitLoad().MustScreenshot("a.png")
	time.Sleep(time.Hour)
}

If you run the module again, you should see a browser like this:

Press CTRL + C on the keyboard to stop the program.

Input and click

Let's automate the website to search the keyword "earth". A website may have many input fields or buttons, we need to tell the program which one to manipulate. Usually, we use Devtools to help us locate the element we want to control. let's append a new config to the ".rod" file to enable the Devtools, now it becomes:

show
devtools

Run the "main.go" again, move your mouse to the input field and right-click above it, you will see the context menu, then click the "inspect":

You should see the <input id="searchInput like below:

Right-click to copy the css selector like the image above. The content on your clipboard will be "#searchInput". We will use it to locate the element to input the keyword. Now the "main.go" becomes:

package main

import (
    "time"

    "github.com/go-rod/rod"
)

func main() {
	page := rod.New().MustConnect().MustPage("https://www.wikipedia.org/").MustWindowFullscreen()

    page.MustElement("#searchInput").MustInput("earth")

	page.MustWaitLoad().MustScreenshot("a.png")
	time.Sleep(time.Hour)
}

The MustWindowFullscreen resizes the browser window to make it easier to debug. We use MustElement and the selector we copied from the Devtools panel to get the element we want to manipulate. The MustElement will automatically wait until the element appears, so we don't need to use MustWaitLoad before it. Then we call the MustInput to input the keyword "earth" into it. If you rerun the "main.go", you should see the result looks like below:

Similar to the input field let's right-click the search button to copy the selector for it:

Then add code to click the search button, now the "main.go" looks like:

package main

import "github.com/go-rod/rod"

func main() {
	page := rod.New().MustConnect().MustPage("https://www.wikipedia.org/").MustWindowFullscreen()

	page.MustElement("#searchInput").MustInput("earth")
	page.MustElement("#search-form > fieldset > button").MustClick()

	page.MustWaitLoad().MustScreenshot("a.png")
}

If we rerun the module, the "a.png" will show the search result:

Slow motion and visual trace

The automated operations are too fast for human eyes to catch, to debug them we usually enable the slow-motion and visual trace configs, let's update the ".rod" file:

show
slow=1s
trace

Then rerun the module, now every action now will wait for 1 second before its execution. On the page, you will see the debug trace generated by Rod like below:

As you can see on the search button, Rod will create a mock mouse cursor.

On console you will see the trace log like below:

[rod] 2020/11/11 11:11:11 [eval] {"js":"rod.element","params":["#searchInput"]}
[rod] 2020/11/11 11:11:11 [eval] {"js":"rod.visible","this":"input#searchInput"}
[rod] 2020/11/11 11:11:11 [input] scroll into view
[rod] 2020/11/11 11:11:11 [input] input earth
[rod] 2020/11/11 11:11:11 [eval] {"js":"rod.element","params":["#search-form > fieldset > button"]}
[rod] 2020/11/11 11:11:11 [eval] {"js":"rod.visible","this":"button.pure-button.pure-button-primary-progressive"}
[rod] 2020/11/11 11:11:11 [input] scroll into view
[rod] 2020/11/11 11:11:11 [input] left click

Other than the ".rod" file

The ".rod" file is just a shortcut for some commonly used API, you can also manually set them in code, such as the "slow", the code to set it is like rod.New().SlowMotion(2 * time.Second). You can also use an environment variable to set it, such as on Mac or Linux: rod=show go main.go.

Read the doc of defaults for more details.

Get text content

Rod provides lots of handy methods to retrieve the contents from the page.

Let's try to get the description of the Earth, use the same technique we previously used to copy the selector from the Devtools:

The method we use is MustText, here's the full code of it:

package main

import (
	"fmt"

	"github.com/go-rod/rod"
)

func main() {
	page := rod.New().MustConnect().MustPage("https://www.wikipedia.org/")

	page.MustElement("#searchInput").MustInput("earth")
	page.MustElement("#search-form > fieldset > button").MustClick()

	el := page.MustElement("#mw-content-text > div.mw-parser-output > p:nth-child(6)")
	fmt.Println(el.MustText())
}

If we rerun the module, we should see the console outputs something like:

Earth is the third planet from the Sun and the only astronomical object known to harbor life.
...

Get image content

Same as get text, we can also get images from the page, let's get the selector of the Earth image and use MustResource to get the binary of the image:

The full code is:

package main

import (
	"github.com/go-rod/rod"
	"github.com/go-rod/rod/lib/utils"
)

func main() {
	page := rod.New().MustConnect().MustPage("https://www.wikipedia.org/")

	page.MustElement("#searchInput").MustInput("earth")
	page.MustElement("#search-form > fieldset > button").MustClick()

	el := page.MustElement("#mw-content-text > div.mw-parser-output > table.infobox > tbody > tr:nth-child(1) > td > a > img")
	_ = utils.OutputFile("b.png", el.MustResource())
}

The output file "b.png" should be:

Next Chapter

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Get Started with Rod

Requirements

First program

See what's under the hood

Input and click

Slow motion and visual trace

Other than the ".rod" file

Get text content

Get image content

Files

README.md

Latest commit

History

README.md

File metadata and controls

Get Started with Rod

Requirements

First program

See what's under the hood

Input and click

Slow motion and visual trace

Other than the ".rod" file

Get text content

Get image content