tagui

<img src="https://raw.githubusercontent.com/kelaberetiv/TagUI/master/src/media/tebel_icon.png" height="111" align="right"> # TagUI [Why This](#why-this) | [Set Up](#set-up) | [To Use](#to-use) | [Cheat Sheet](#cheat-sheet) | [Developers](#developers-reference) | [**Tutorial**](https://github.com/kelaberetiv/TagUI/blob/master/src/media/RPA%20Workshop.md) | [**Slides**](https://prezi.com/p/f5vag20tuth8/) | [**Video**](https://www.youtube.com/watch?v=hzE4tKlzzg4) **TagUI is a command-line tool for digital process automation (RPA). It's maintained by [AI Singapore](https://www.aisingapore.org), a government-funded initiative to build local artificial intelligence capabilities. To start, click above tutorial, slides, or video links.** ### FEATURES - automate Chrome visibly or headlessly - visual automation of websites and desktop - write in 20+ human languages & JavaScript - Chrome extension for recording web actions - R & Python integration for big data / AI / ML # Why This The goal of UI (user interface) automation is to reproduce cognitive interactions that you have with websites or your desktop, so that your computer can do it for you, base on your schedule or conditions. TagUI helps you rapidly automate repetitive or time-critical tasks - use cases include digital process automation, data acquisition and testing. Read on for more info or jump right to the [flow samples section](https://github.com/kelaberetiv/TagUI#flow-samples) to see examples of TagUI automation in natural-language-like syntax, which makes automation easy to maintain. This is a full-feature and free open-source tool, so there's nothing to upgrade to or any paid subscription. To feedback suggestions or bugs, [raise an issue on GitHub](https://github.com/kelaberetiv/TagUI/issues). <details> <summary> Click to show differences between TagUI open-source RPA and commercial RPA software </summary> **Key Strengths** - cross-platform, works on Windows, macOS, Linux - increased security as users can view source code - $0 to use, under Apache 2.0 open-source license - easy to use, thus rapid development + deployment - easy for IT policies to deploy, simply unzip and run - native integration with Python and R for AI / ML / DL - easy API calls to Azure / Amazon cognitive services **Neutral Differences** - scripts written in 21 human languages, not flowchart - JavaScript for advanced scripting instead of C# / VB - visual and OCR based automation for desktop apps **Key Weaknesses** - lack of enterprise audit, control, dashboard, reporting - lack of SLA or 24/7 support team for incident-handling - lack of large development teams (30 to 100+ people) - lack of user / developer base grown over many years - lack of consultancies / partners distribution network </details> ### HOW IT WORKS TagUI converts your intentions from various human languages to lines of working JavaScript code that perform UI automation. Under the hood, it uses [Chrome DevTools Protocol](https://chromedevtools.github.io/devtools-protocol/), [SikuliX](http://sikulix.com), [CasperJS](http://casperjs.org), [PhantomJS](http://phantomjs.org) & [SlimerJS](https://slimerjs.org). ``` // sample TagUI flow to log into Typeform and download a survey report https://www.typeform.com click login type username as user@gmail.com type password as 12345678 click btnlogin download https://admin.typeform.com/xxx to report.csv ``` ``` // besides web element identifiers, images of the elements can be used click login_button.png type username_box.png as user@gmail.com ``` ``` // in addition, (x,y) coordinates of user-interface elements can be used click (1200,200) type (800,400) as user@gmail.com ``` <details> <summary> Click to show powerful and user-friendly features of TagUI that come right out of the box </summary> - TagUI will instantly convert the automation flow above into JavaScript code and automatically perform the steps to download a report. Conditions can also be written in natural language for making decisions or checking webpage. No further backend coding or step definition is required. This makes it easy for rapid prototyping, deployment and maintenance of UI automation, whether you are a developer or not. The language engine supports over 20 languages and can be modified or extended easily by users to improve accuracy or add more languages. - The automation flow can be triggered from scheduling, command line (in natural language), API URL, email etc. Everything happens headlessly in the background without seeing any web browser, so that you can continue using the computer or server uninterrupted. Running on a visible web browser is also supported, using Chrome or Firefox browser (see chrome or firefox option below). - API or command calls can be made with a single line to integrate with other services or apps. Continuous integration with CI/CD tools is possible using CasperJS framework and TagUI's Chrome integration. For auditing and personal tracking, there is a report option which can track all your run results in a csv spreadsheet, complete with execution logs in html format. - If you know JavaScript and want to be more expressive, you can even use JavaScript directly in the flow. If not, you will still enjoy friendly but powerful features such as repositories to store your reusable objects, flexible datatables for batch automation, and a Chrome extension which creates automation flows by recording your actions. For rapid prototyping, there's also an interactive live mode for trying out TagUI steps or JavaScript code in real-time. TagUI has built-in integration with Chrome / headless Chrome directly, which also works in live mode. - There is automatic waiting for web elements to appear + error-checking + nesting of JavaScript code blocks. Not forgetting the option to run automation flows hosted online or auto-upload run results online for sharing. TagUI also supports visual automation of website and desktop through built-in integration with SikuliX. Instead of using element identifiers, images can be used to identify user interface elements to interact with. TagUI has native integrations with R and Python for big data and machine learning capabilities. </details> # Set Up TagUI is in v5.0 - it unzips and runs on macOS, Linux, Windows ([link to release notes](https://github.com/kelaberetiv/TagUI/releases)) ### PACKAGED INSTALLATION - TagUI is easy to use right away, in most environments all dependencies are packaged in - To use [visual automation](https://github.com/kelaberetiv/TagUI#visual-automation) on desktop or browser, be sure to have Java JDK v8 (64-bit) or later Platform|macOS|Linux|Windows|Node.js (macOS/Linux) :------:|:---:|:---:|:-----:|:-------------------: Package|[unzip and run](https://github.com/tebelorg/Tump/releases/download/v1.0.0/TagUI_macOS.zip)|[unzip and run](https://github.com/tebelorg/Tump/releases/download/v1.0.0/TagUI_Linux.zip)|[unzip and run](https://github.com/tebelorg/Tump/releases/download/v1.0.0/TagUI_Windows.zip)|[npm install tagui](https://www.npmjs.com/package/tagui) **Recommended locations to unzip to** - Windows - c:\ - macOS - desktop - Linux - /home/your_id > avoid spaces in the folder path as some components of TagUI don't work well with spaces in folder and file names > optional - configure web browser settings in tagui_config.txt, such as browser resolution, step timeout of 10s etc **Tip** - to try [cutting edge version](https://github.com/kelaberetiv/TagUI/compare/v5.0.0...master) with the latest features, [download master.zip](https://github.com/kelaberetiv/TagUI/archive/master.zip) to overwrite your existing packaged installation (be sure to manually select and move the folders & files inside master.zip's TagUI-master/src folder to replace your existing tagui/src folder, some OSes will delete existing target folders that are missing from source folder) ### MANUAL INSTALLATION <details> <summary> Click to show step-by-step setup if you prefer to download dependencies manually from their websites </summary> 1. PhantomJS (headless scriptable web browser) - http://phantomjs.org 2. CasperJS (navigation/testing for PhantomJS) - http://casperjs.org 3. SlimerJS (scriptable web browser for Firefox) - https://slimerjs.org 4. TagUI (general purpose UI automation tool) - https://git.io/vMCTZ 5. SikuliX (required for visual automation features) - http://sikulix.com 6. PHP (only required for manual Windows setup) - http://windows.php.net **Tip** - recommend putting TagUI to a folder path without spaces (some dependencies have issue with that). for manual Windows setup, 1. set SLIMERJS_EXECUTABLE env variable to point to slimerjs.bat, 2. put [GNU utilities](http://unxutils.sourceforge.net) (cut / gawk / grep / head / sort / tail / tee), [curl ssl](https://curl.haxx.se) in tagui\src\unx, 3. add phantomjs\bin, casperjs\bin, php folders to path </details> # To Use ### COMMAND LINE `tagui flow_filename option(s)` for Windows `./tagui flow_filename option(s)` for macOS/Linux - Flow filename can be a local file or the URL of an online file - Filename can have no extension, .txt or .js or .tagui extension - Type tagui without parameters to see its version and options Log files (.log .raw .js) are created after running. To turn off, put an empty file tagui_no_logging in tagui/src folder. > If your command prompt or terminal font size is too small, you can set it to much larger font sizes for easier reading. > Below example will run a script to perform a search on Yahoo website and capture a screenshot of the results. **Windows** - unzip the tagui folder to c:\\. Open command prompt with Start Menu -> type run -> type cmd and enter ``` c: cd c:\tagui\src tagui samples\1_yahoo ``` **macOS** - unzip the tagui folder to your desktop. Open terminal from Apps -> Utilities -> Terminal and enter commands ``` cd /Users/your_id/Desktop/tagui/src ./tagui samples/1_yahoo ``` **Linux** - unzip the tagui folder to a convenient folder on your laptop for example /home/your_id and enter commands ``` cd /home/your_id/tagui/src ./tagui samples/1_yahoo ``` > if the script works successfully, you will notice five .png files - congratulations, you have run your first TagUI script! **Troubleshooting potential exceptions** - For Windows computers, if you see 'MSVCR110.dll is missing' error, install [this from Microsoft website](https://www.microsoft.com/en-us/download/details.aspx?id=30679) (choose vcredist_x86.exe) - this file is required to run the Windows PHP engine packaged with TagUI. - For some newer macOS versions, if you get a 'dyld: Library not loaded' error, [install OpenSSL in this way](https://github.com/kelaberetiv/TagUI/issues/86#issuecomment-372045221). - For some flavours of Linux (Ubuntu for example), which do not have PHP pre-installed, google how to install PHP accordingly (eg Ubuntu, apt-get install php). Most Linux distributions would already come with PHP. - For Firefox automation, download an [older version here](https://ftp.mozilla.org/pub/firefox/releases/59.0/) (SlimerJS doesn't work with Firefox v60 onwards). Now, you can try the same automation script with Chrome browser by running with chrome option (for Windows enter `tagui samples\1_yahoo chrome`, for macOS/Linux enter `./tagui samples/1_yahoo chrome`). The automation will now run in the foreground instead, so you'll be able to see the navigation on Yahoo and DuckDuckGo websites. TagUI can also be run from desktop icons, scheduled tasks, or REST API calls. <details> <summary> Click to show the command line options for TagUI tool and their purposes </summary> Option|Purpose :----:|:------ chrome|run on visible Chrome web browser instead of invisible PhantomJS (first install [Chrome](https://www.google.com/chrome/)) headless|run on invisible Chrome web browser instead of default PhantomJS (first install Chrome) firefox|run on visible Firefox web browser instead of invisible browser (first install [Firefox](https://www.mozilla.org/en-US/firefox/new/)) report|track run result in tagui/src/tagui_report.csv and save html log of automation execution upload|upload automation flow and result to [hastebin.com](https://hastebin.com) (expires 30 days after last view) speed|skip 3-second delay between datatable iterations (and skip restarting of Chrome) quiet|run without output except for explicit output (echo / show / check / errors etc) debug|show run-time backend messages from PhantomJS mode for detailed tracing and logging test|testing with check step test assertions for CI/CD integration (output XUnit XML file) baseline|output execution log and relative-path output files to a separate baseline directory input(s)|add your own parameter(s) to be used in your automation flow as variables p1 to p9 data.csv|specify a csv file to be used as the datatable for batch automation of many records </details> <details> <summary> Click to show info on automation logs, and how to run tagui from any directory </summary> After each automation run, a .log file will be created to store output of the execution, a .js file is the generated JavaScript file, a .raw is the expanded flow after reading in any module sub-scripts that are called in that flow. These files are for user reference purpose and can be helpful in debugging or troubleshooting the automation flow. To turn off creation of these files, put an empty file tagui_no_logging in tagui/src folder. **To run tagui from anywhere** - For MacOS/Linux, use ln -sf /full_path/tagui/src/tagui /usr/local/bin/tagui to create symbolic link - For Windows, add tagui/src [folder to path](http://lmgtfy.com/?q=add+to+path+in+windows), then tagui will be accessible from any folder </details> <details> <summary> Click to show how to run TagUI scripts by double-clicking as desktop icons </summary> To do that on Windows, create a .cmd or .bat file with contents like the following, which goes to the directory where you want to run the automation, and run tagui command on the file with your specified options. Double-clicking the .cmd or .bat file will start automation. You can also [associate .tagui files directly](https://www.digitaltrends.com/computing/how-to-set-default-programs-and-file-types-in-windows-10) to be opened by tagui\src\tagui.cmd command. ``` @echo off c: cd c:\folder tagui filename quiet speed chrome ``` To do that on macOS / Linux, create a file with contents like the following, which goes to the directory where you want to run the automation, and run tagui command on the file with your specified options. You will need to use the command chmod 700 on the file to give it execute permissions, so that it can be run by double-clicking on it. ``` cd /Users/username/folder tagui filename quiet speed chrome ``` </details> ### BY SCHEDULING To schedule an automation flow with crontab (macOS/Linux), for example at 8am daily ``` 0 8 * * * /full_path_on_your_server/tagui flow_filename option(s) ``` **Tip** - for Windows, use Task Scheduler (search schedule from Start Menu) or [Z-Cron freeware](https://www.z-cron.com) ### TAGUI WRITER, SCREENSHOTER & EDITOR TagUI Writer is a Windows app created by Arnaud Degardin / [@adegard](https://github.com/adegard) which makes it easy to write TagUI scripts. By pressing Ctrl + Left-click, a popup menu will appear with the list of TagUI steps for you to paste into your text editor. Arnaud also created a ScreenShoter app which makes it easy to capture snapshots for TagUI visual automation. Lastly, TagUI Editor allows you to edit and run TagUI scripts via AutoHotKey. To download, [click here](https://github.com/adegard/tagui_scripts). <details> <summary> Click to show a preview of how TagUI Editor can be used to edit and run TagUI scripts </summary> ![TagUI Editor](https://raw.githubusercontent.com/adegard/tagui_scripts/master/TagUI_Editor.gif) </details> ### CHROME EXTENSION Download from [Chrome Web Store](https://chrome.google.com/webstore/detail/tagui-web-automation/egdllmehgfgjebhlkjmcnhiocfcidnjk/) to use TagUI Chrome web browser extension for recording automation flows. TagUI Chrome extension records steps such as page navigation, clicking web elements and entering information. <details> <summary> Click to show details on TagUI Chrome extension (based on https://github.com/ebrehault/resurrectio) </summary> **To start recording automation flows** 1. Go to the website URL you want to start the automation at 2. Click the TagUI icon, followed by the Start button 3. Carry out the steps you want to automate 4. Click the TagUI icon, followed by the Stop, then Export buttons to view the generated TagUI script > while recording the steps, you can right click to bring up a menu for steps such as showing the element identifier The recording isn't foolproof (for example, the underlying recording engine cannot capture frames, popup windows or tab key input). It's meant to simplify flow creation with some edits, instead of typing everything manually. [See this video](https://www.youtube.com/watch?v=bFvsc4a8hWQ) for an example of recording a sequence of steps, editing for adjustments and playing back the automation. </details> ### VISUAL AUTOMATION TagUI has built-in integration with [SikuliX (base on OpenCV)](http://sikulix.com) to allow visually identifying the web elements and desktop UI (user interface) elements for interaction by providing their images or (x,y) coordinates. Applicable steps are click, hover, type, select, read, show, save, snap, keyboard, mouse. To use visual automation, Java JDK v8 (64-bit) or later is required. <details> <summary> Click to show how to download and check for Java, how to use visual automation, and a demo video </summary> 1. Check that [Java JDK (64-bit)](http://www.oracle.com/technetwork/java/javase/downloads/jdk8-downloads-2133151.html) is installed (entering `java -version` returns your Java version) 2. After Java is installed, you will have to restart your command prompt or terminal to use it 3. On Windows, set display magnification to recommended %, if it doesn't work then 100% 4. On Windows, if TagUI just hangs there, see if it's due to [this issue and try the solution](https://github.com/kelaberetiv/TagUI/issues/229) 5. On macOS, if can't find image on screen, may be due to [how the image was captured](https://github.com/kelaberetiv/TagUI/issues/240#issuecomment-405030276) 6. On Linux, requires installing and setting up dependencies by following [this guide](https://sikulix-2014.readthedocs.io/en/latest/newslinux.html#version-1-1-4-special-for-linux-people) To use visual automation, simply specify an image (in .png or .bmp format) to visually look for in place of the element identifier. Alternatively, you can specify the (x,y) coordinates of the element that you want to interact with. > Important! The element that corresponds to the image must be visible on the screen for visual automation to succeed. If it's blocked by another window for example, the automation will be unable to find the element. To type onto the screen instead of a particular element, use `keyboard text` or `keyboard [modifiers]text` ([examples](https://github.com/kelaberetiv/TagUI/issues/370)). To do a snapshot or an OCR of the whole screen, use `page.png` or `page.bmp` as the element identifier for steps snap / read. Relative paths are supported for image filenames (eg pc.png, images/button.bmp). The usual helper functions visible() / present() can also be used to check whether an image is visible on the screen. The keyboard and mouse steps, as well as helper functions mouse_xy(), mouse_x(), mouse_y(), can be used to do complex UI interactions. A screen (real or Xvfb) is needed for visual automation. [Tesseract OCR](https://github.com/tesseract-ocr/tesseract) (optical character recognition) is used for visually retrieving text. Also, by using vision step, you can send [custom SikuliX commands](http://sikulix-2014.readthedocs.io/en/latest/genindex.html) to do things that are not covered by TagUI. ![Sample Visual Automation](https://raw.githubusercontent.com/tebelorg/Tump/master/visual_flow.gif) </details> ### NATIVE LANGUAGES To run TagUI flows in native languages or output flow execution in other languages ([see demo run](https://github.com/kelaberetiv/TagUI/issues/68#issuecomment-344380657)) 1. set your default flow language with [tagui_language variable](https://github.com/kelaberetiv/TagUI/blob/master/src/tagui_config.txt) in tagui_config.txt 2. write automation flow in native language base on [language definition .csv files](https://github.com/kelaberetiv/TagUI/tree/master/src/languages) 3. optional - set tagui_language in flow to any other languages as output language <details> <summary> Click to show the 20+ human languages supported by TagUI and how to self-build language definitions </summary> **Tip** - as Windows Unicode support isn't as straightforward as macOS/Linux, doing this in Windows may require changing system locale, using chcp command, and selecting a font to display native language correctly ([more info](http://www.walkernews.net/2013/05/19/how-to-get-windows-command-prompt-displays-chinese-characters/)) TagUI language engine supports over 20 languages and can be modified or extended easily by users to improve accuracy or add more languages. The languages are Bengali, Chinese, English, French, German, Hindi, Hungarian, Indonesian, Italian, Japanese, Korean, Polish, Portuguese, Romanian, Russian, Serbian, Spanish, Tagalog, Tamil, Thai, Vietnamese. This starting set is partly chosen base on the [list of most commonly used languages](https://www.babbel.com/en/magazine/the-10-most-spoken-languages-in-the-world), partly from the countries around where I'm from (Singapore), and partly from countries with a lot of developers. If your native language isn't in the above list, you can also automate building a new native language definition by using this language [build automation flow](https://github.com/kelaberetiv/TagUI/blob/master/src/languages/build) (src/languages/build) that self-builds the vocabulary set using Google Translate. To do that, update [build.csv](https://github.com/kelaberetiv/TagUI/blob/master/src/languages/build.csv) with the languages that you want to build and run `tagui build using chrome` in src/languages folder. Use quiet option to hide the verbose automation output. The generated files are named as their 2-character language codes to prevent overwriting existing language definitions by accident. To use the generated .csv files, rename them to their full language names. See [full list of languages possible](https://cloud.google.com/translate/docs/languages) to be generated by Google Translate. Most of the language definitions are automatically self-built using Google Translate (except english.csv, chinese.csv, japanese.csv, german.csv, french.csv which have been vetted), and would be wrong without understanding vocabulary used in UI interaction context. Native language users can update the language definition csv themselves and are invited to submit PRs with correct words to be used. Some languages are very different from English structure (for eg, written from right to left, different order of adjective and noun) and would be impossible to use correctly in TagUI. </details> ### PYTHON INTEGRATION TagUI has built-in integration with Python (works out of the box for both v2 & v3) - a programming language with many popular frameworks for big data and machine learning. The py step can be used to run commands in Python and retrieve the output of those commands. To use Python integration in TagUI, first [download Python for your OS](https://www.python.org/). macOS and Linux normally come pre-installed with Python. Make sure that python command is accessible from command prompt. <details> <summary> Click to show how to use py step in your automation flow to send and receive data from Python frameworks </summary> In your automation flow, use the py step followed by the Python commands to be executed, separated by `;`. You can then use the `print` command in Python to output the result you want to be accessible in your automation flow as `py_result` variable. If the result is JSON data, the JSON object `py_json` will be created for easy access to JSON data elements. If not, `py_json` will be null. ``` // using py step to denote Python code, and getting back output from py_result py a=1;b=2 py c=a+b py print(c) echo py_result // alternatively, you can use py begin and py finish to denote a Python code block // indentation of Python code is also supported, for example in conditions or loops py begin a=1;b=2 c=a+b print(c) py finish echo py_result // an example of passing dynamically generated variables to Python integration phone = 1234567 name = 'donald duck' py_step('phone = ' + phone) py_step('name = "' + name + '"') py print(name) echo py_result py print(phone) echo py_result ``` You can also use the `execfile()` command in Python v2.X to run Python scripts. Or use `exec(open('filename').read())` in [Python v3.X to run Python scripts](https://stackoverflow.com/questions/436198/what-is-an-alternative-to-execfile-in-python-3). For examples of using Python for machine learning, check out this [essentials of machine learning algorithms](https://www.analyticsvidhya.com/blog/2017/09/common-machine-learning-algorithms/) article or this article on [Python deep learning frameworks](https://www.kdnuggets.com/2017/02/python-deep-learning-frameworks-overview.html). </details> ### R INTEGRATION TagUI has built-in integration with R - an open-source software environment for statistical computing and graphics. R can be used for big data and machine learning. The r step can be used to run commands in R and retrieve the output of those commands. To use R integration in TagUI, first [download R software for your OS](https://www.r-project.org/). Make sure that Rscript command is accessible from your command prompt (added to path or symbolically linked). <details> <summary> Click to show how to use r step in your automation flow to send and receive data from R frameworks </summary> In your automation flow, use the r step followed by the R commands to be executed, separated by `;`. You can then use the `cat()` command in R to output the result you want to be accessible in your automation flow as `r_result` variable. If the result is JSON data, the JSON object `r_json` will be created for easy access to JSON data elements. If not, `r_json` will be null. ``` // using r step to denote R code, and getting back output from r_result r a=1;b=2 r c=a+b r cat(c) echo r_result // alternatively, you can use r begin and r finish to denote a R code block r begin a=1;b=2 c=a+b cat(c) r finish echo r_result // an example of passing dynamically generated variables to R integration phone = 1234567 name = 'donald duck' r_step('phone = ' + phone) r_step('name = "' + name + '"') r cat(name) echo r_result r cat(phone) echo r_result ``` You can also use the `source()` command in R to run R scripts. For examples of using R for machine learning, check out this [essentials of machine learning algorithms](https://www.analyticsvidhya.com/blog/2017/09/common-machine-learning-algorithms/) article or this [guerilla guide to machine learning](https://www.kdnuggets.com/2017/05/guerrilla-guide-machine-learning-r.html) video series. </details> ### CLI ASSISTANT TagUI scripts are already in natural-language-like syntax to convert to JavaScript code. What's even better is having natural-language-like syntax on the command line. Instead of typing `tagui download_bank_report june creditcard` to run the automation flow download_bank_report with parameters june creditcard, you can type `erina download my june creditcard bank report`. For a demo of the CLI (command-line interface) assistant in action, [see this video](https://www.youtube.com/watch?v=Sm4WNQ89gRA). <details> <summary> Click to show details on how you can rename your CLI assistant and the syntax used to invoke automations </summary> The commands erina (macOS/Linux) and erina.cmd (Windows) can be renamed to any other names you like. The commands can be set up in the same way as the tagui / tagui.cmd above to be accessible from any folder. The command basically interprets this general syntax `erina single-word-action fillers options/parameters fillers single-or-multi-word-context` to call run the corresponding automation flow `action_context` with `options/parameters`. Also, adding `using chrome` / `using headless` / `using firefox` at the end will let it run using the respective browsers. The default location where automation flows are searched for is in tagui/flows folder and can be changed in tagui_helper.php. Filler words (is, are, was, were, my, me) are ignored as they don't convey important information ([more design info](https://github.com/kelaberetiv/TagUI/issues/44#issuecomment-321108786)). </details> ### FLOW SAMPLES Following automation flow samples ([tagui/src/samples folder](https://github.com/kelaberetiv/TagUI/tree/master/src/samples)) are included with TagUI Flow Sample |Purpose :-----------|:------ [1_yahoo](https://github.com/kelaberetiv/TagUI/blob/master/src/samples/1_yahoo)|searches github on Yahoo and captures screenshot of results [2_twitter](https://github.com/kelaberetiv/TagUI/blob/master/src/samples/2_twitter)|goes to a Twitter page and saves some profile information [3_github](https://github.com/kelaberetiv/TagUI/blob/master/src/samples/3_github)|goes to a GitHub page and downloads the repository file [4_conditions](https://github.com/kelaberetiv/TagUI/blob/master/src/samples/4_conditions)|goes through examples of using conditions in natural language [5_repositories](https://github.com/kelaberetiv/TagUI/blob/master/src/samples/5_repositories)|shows using repositories on Russian social media site VK.com [6_datatables](https://github.com/kelaberetiv/TagUI/tree/master/src/samples/6_datatables)|set of flows uses datatables to retrieve and act on GitHub info [7_testing](https://github.com/kelaberetiv/TagUI/blob/master/src/samples/7_testing)|shows how to use check step assertions for CI/CD integration [8_hastebin](https://github.com/kelaberetiv/TagUI/blob/master/src/samples/8_hastebin)|used by upload option to upload flow result to hastebin.com [9_misc](https://github.com/kelaberetiv/TagUI/blob/master/src/samples/9_misc)|shows how to use steps popup, frame, dom, js, { and } block [a_facedetect](https://github.com/kelaberetiv/TagUI/blob/master/src/samples/a_facedetect)|uses face recognition to detect profile images on webpages [b_visualoutlook](https://github.com/kelaberetiv/TagUI/issues/113#issuecomment-378194245)|uses visual recognition for desktop MS Outlook email sending [c_chineseflow](https://github.com/kelaberetiv/TagUI/blob/master/src/samples/c_chineseflow)|run flow in other languages (first, change src/tagui_config.txt) # Cheat Sheet ### AUTOMATION WORKFLOW - What happens behind the scenes when you run an automation flow ![TagUI Flowchart](https://raw.githubusercontent.com/kelaberetiv/TagUI/master/src/media/flowchart.png) ### FIND XPATH OF WEB ELEMENT - In Chrome browser, right-click on the element, click Inspect, right-click on HTML code block, then ![TagUI Flowchart](https://raw.githubusercontent.com/kelaberetiv/TagUI/master/src/media/find_xpath.png) ### STEPS DESCRIPTION - TagUI auto-waits for a webpage element to appear and interacts with it as soon as it appears - Element identifier can be auto-recorded using TagUI Chrome extension, or [found from web browser](https://github.com/kelaberetiv/TagUI#find-xpath-of-web-element) - Identifiers help to pinpoint which webpage elements you want to interact with ([examples in flow samples](https://github.com/kelaberetiv/TagUI#flow-samples)) - TagUI auto-selects provided identifier in this order - xpath, css, id, name, class, title, aria-label, text(), href Basic Step|Parameters (separator in bold)|Purpose :---------|:-----------------------------|:------ http(s)://|just enter full url of webpage (\`variable\` for variable)|go to specified webpage click|element to click|click on an element rclick|element to right-click|right-click on an element dclick|element to double-click|double-click on an element hover|element to hover|move cursor to element type|element ***as*** text ([enter] = enter, [clear] = clear field)|enter element as text select|element to select ***as*** option value ([clear] = clear selection)|choose dropdown option read|element to read (page = webpage) ***to*** variable name|fetch element text to variable show|element to read (page = webpage, ie raw html) |print element text to output save|element (page = webpage) ***to*** optional filename|save element text to file snap|element (page = webpage) ***to*** optional filename|save screenshot to file snap (pdf)|page ***to*** filename.pdf (headless Chrome / PhantomJS)|save webpage to basic pdf load|filename ***to*** variable name|load file content to variable echo|text (in quotation marks) and variables|print text/variables to output dump|text (in quotation marks) and variables ***to*** optional filename|save text/variables to file write|text (in quotation marks) and variables ***to*** optional filename|append text/variables to file variable_name| = value (for text, put in quotes, use + to concat)|define variable variable_name // (on new line)|user comments (ignored during execution)|add user comments ask|question or instruction for user (reply stored in ask_result)|ask user for input live|try steps or code interactively for Chrome / visual automation|enter live mode ([Firefox not yet](https://github.com/laurentj/slimerjs/issues/639)) **Tip** - to use variables where text is expected, \`variable\` can be used. XPath is an expressive way to identify web elements. If you know xpath and use xpath for element identifier, use double quotes for text //\*[@title="Login"] <details> <summary> Click to show pro steps such as tagui, keyboard, mouse, table, wait, check, api, run, dom, js, r, py, vision, code blocks </summary> Pro Step|Parameters (separator in bold)|Purpose :-------|:-----------------------------|:------ tagui|relative or absolute filename (see MODULES section)|run another tagui flow keyboard|keystrokes and modifiers (using visual automation)|send keystrokes to screen mouse|down or up (using sikuli visual automation)|send mouse event to screen table|element (XPath selector only) ***to*** optional filename.csv|save basic html table to csv wait|optional time in seconds (default is 5 seconds)|explicitly wait for some time check|condition **|** text (in quotes) if true **|** text (in quotes) if false|check condition and print result upload|element (CSS selector only) ***as*** filename to upload|upload file to website download|url to download ***to*** filename to save|download from url to file receive|url keyword to watch ***to*** filename to save|receive resource to file frame|frame name **|** subframe name if any|next step or block in frame/subframe popup|url keyword of new tab window to look for|next step or block in new tab window { and }|use { to start block and } to end block (on new line)|define block of steps and code api|full url (including parameters) of api call|call api & save response to api_result run|OS shell command including parameters|run OS command & save to run_result dom|javascript code for document object model|run code in dom & save to dom_result js|javascript statements (skip auto-detection)|treat as JS code explicitly r|R statements for big data and machine learning|run R statements & save to r_result py|python code for big data and machine learning|run python code & save to py_result vision|custom visual automation commands|run custom sikuli commands timeout|time in seconds before step errors out|change auto-wait timeout **Tip** - for headless and visible Chrome, file downloads can be done using normal webpage interaction or specifying the URL as a navigation flow step. For Firefox and PhantomJS, the download and receive step can be used. > on Windows, snap step requires display magnification to be set at 100% to work properly As TagUI default execution context is local, to run javascript on webpage dom (eg document.querySelector) use dom step. Set dom_json variable to pass a variable for use in dom step. Or dom_json = {tmp_number: phone, tmp_text: name} to pass multiple variables for use in dom step (dom_json.tmp_number and dom_json.tmp_text). > For steps run, dom, js, r, py, vision, instead of typing the step and the command, you can use something like py begin followed by many lines of py code, and end with py finish to denote an entire code block. This saves typing the step repeatedly for a large integration code block. Indentation of Python code within py begin-finish and vision begin-finish blocks is supported, for example in conditions or loops. For steps r, py, vision, the helper functions r_step(), py_step(), vision_step() can be used to pass dynamic variables to those integrations. Below is an example for py step for passing variables from TagUI to Python integration. ``` phone = 1234567 name = 'donald duck' py_step('phone = ' + phone) py_step('name = "' + name + '"') py print(name) echo py_result py print(phone) echo py_result ``` </details> ### CONDITIONS EXAMPLES <details> <summary> Click to show how conditions can be expressed in natural language (optional brackets) or JavaScript </summary> - Conditions help in decision-making and taking different actions base on run-time context - Write text in quotation marks (either " or ' works) to differentiate text from variable names - { and } block is required after for loop (while is deprecated - no auto-wait, hangs CasperJS) - Use for loop to repeat blocks of steps (nested loops, conditions, break, continue supported) Condition (in natural language)|JavaScript :------------------------------|:--------- example - if day equals to "Friday"| if (day == "Friday") example - if menu contains "fruits"| if (menu.indexOf("fruits")>-1) example - if A more than B and C not equals to D | if ((A > B) && (C != D)) example - for n from 1 to 4 | for (n=1; n<=4; n++)] example - for n from 1 to infinity | for (n=1; n<=1024; n++)] example - while cupcakes equal to 12| while (cupcakes == 12) contain|.indexOf("text")>-1 not contain|.indexOf("text")\<0 equal to|== not equal to|!= more than / greater than / higher than|> more than or equal to / greater than or equal to / higher than or equal to|>= less than / lesser than / lower than|< less than or equal to / lesser than or equal to / lower than or equal to|<= and|&& or||| **Tip** - use { and } step to define step/code blocks for powerful repetitive automation with for loop. The usual break and continue commands can be used on next line after an if condition, to break out of the immediate loop or continue to the next iteration. To loop 'indefinitely' use `for n from 1 to infinity`, where infinity is a pre-defined variable at 1024. When using contain / equal, you can write with or without s behind. You can use if present('element') to check if the element exists, before doing the step on next line. Other useful functions include visible('element'), count('element'), url(), title(), text(), timer(), which can be used in conditions and steps such as check or echo. </details> ### HELPER FUNCTIONS <details> <summary> Click to show csv_row(), present(), visible(), count(), url(), title(), text(), timer(), mouse_xy(), mouse_x(), mouse_y() </summary> - Below are helper functions which can be used in your steps or code like a variable - You can define your own JS functions in tagui_local.js / tagui_global.js ([see eg](https://github.com/kelaberetiv/TagUI/issues/236#issuecomment-403188526)) - TagUI will look for a local function file tagui_local.js in the same folder of flow - TagUI will also look for a global function file tagui_global.js in tagui/src folder - Local definitions take precedence over global definitions (same for repositories) Function|Purpose :-------|:------ csv_row(row_array)|return formatted string for writing to csv file, eg using write step present('element')|return true or false whether element identifier specified is present visible('element')|return true or false whether element identifier specified is visible count('element')|return number of elements matching element identifier specified url()|return page url of current web page title()|return page title of current web page text()|return text content of current web page timer()|return time elapsed in sec between calls mouse_xy()|return (x,y) coordinates as string, for eg '(200,400)' mouse_x()|return x coordinate as integer number, for eg 200 mouse_y()|return y coordinate as integer number, for eg 200 ``` // example of using csv_row() function to save data as CSV file read name_element to name read price_element to price read details_element to details item_info = [name, price, details] write csv_row(item_info) to product_list.csv ``` **Tip** - mouse_xy(), mouse_x(), mouse_y() require visual automation and can be used to interact with UI (user-interface) elements on the screen by specifying their (x,y) coordinates, eg clicking to the right of an UI element by 200 pixels - ``` // example of using mouse_xy(), mouse_x(), mouse_y() functions hover element.png echo mouse_xy() x = mouse_x() + 200 y = mouse_y() click (`x`,`y`) ``` </details> ### MODULES <details> <summary> Click to show how you can call other TagUI automation flows within an automation flow file </summary> - You can reuse and combine automation scripts to build complex flows - Sub-scripts can be multiple levels deep and be of any filename or extension - To call sub-scripts, use tagui step followed by absolute or relative filename - For example, tagui login_crm or tagui crm.login or tagui outlook\sendmail - Variables can be directly used or modified in parent script and sub-scripts **Tip** - tagui step works by expanding content of a sub-script into the parent script, at the line where tagui step is used to call the sub-script. Thus variables that are accessible from the parent flow file will also be accessible from the sub-script. A .raw file will be created to store expanded contents of the automation flow (useful for checking error messages). Alternatively, try using run step to run tagui on a flow file, so that it's run as a separate child process. </details> ### REPOSITORIES <details> <summary> Click to show how you can use repositories to make objects reusable and improve readability </summary> - Repository csv files have 2 columns, for example below (headers up to you to name) - TagUI will look for a local repository file tagui_local.csv in the same folder of flow - TagUI will also look for a global repository file tagui_global.csv in tagui/src folder - Using \`object\` in your flow replaces it with its definition (which can contain objects) - For example, \`type email\` becomes type user-email-textbox as user@gmail.com OBJECT|DEFINITION :-----|:--------- email|user-email-textbox create account|btn btn--green btn-xl signup-btn type email|type \`email\` as user@gmail.com </details> ### DATATABLES <details> <summary> Click to show how datatable csv files can be used to perform batch automation at scale </summary> - When running TagUI, specify the csv file to use, eg tagui flow_filename trade_data.csv - TagUI loops through each row to run automation using the data from different rows - Using \`column_name\` in your flow replaces it with its value for that iteration - Eg, echo 'TESTCASE - \`testname\`' in your flow shows TESTCASE - Trade USDSGD - echo '\`[iteration]\`' can be used in your flow to show the current iteration number \#|testname|username|password|pair|size|direction :-|:-------|:-------|:-------|:---|:---|:-------- 1 |Trade USDSGD|test_account|12345678|USDSGD|10000|BUY 2 |Trade USDSGD|test_account|12345678|USDJPY|1000|SELL 3 |Trade EURUSD|test_account|12345678|EURUSD|100000|BUY **Tip** - using speed option will skip the 3-second delay between iterations (and skip restarting of Chrome browser) </details> ### AUDIT & REPORTING <details> <summary> Click to show how the report option can be used to track and store automation results </summary> - when report option is used, TagUI will track run results to tagui/src/tagui_report.csv - this can be useful for personal tracking, auditing purpose, or measuring time savings - html logs will also be generated to store the full execution sequences and outcomes \#|AUTOMATION FLOW|START TIME|FINISH TIME|ERROR STATUS|LOG FILE :-|:--------------|:---------|:----------|:-----------|:------- 1 | /Users/kensoh/Desktop/download_flow | Sun Apr 07 2019 17:33:58 GMT+0800 (+08) | 32.1 | SUCCESS | /Users/kensoh/Desktop/download_flow_1.html 2 | /Users/kensoh/Desktop/upload_flow | NOT STARTED | NOT FINISHED | [LINE 1] cannot understand step self-destruct | /Users/kensoh/Desktop/upload_flow_2.html 3 | /Users/kensoh/Desktop/update_flow | NOT STARTED | NOT FINISHED | [LINE 1] cannot understand step reboot computer | /Users/kensoh/Desktop/update_flow_3.html 4 | /Users/kensoh/Desktop/booking_flow | Sun Apr 07 2019 17:39:00 GMT+0800 (+08) | NOT FINISHED | cannot find confirm_booking | /Users/kensoh/Desktop/booking_flow_4.html </details> # Developers Reference TagUI excels in automating user-interface interactions. It's designed to make prototyping, deployment and maintenance of UI automation easier by minimizing iteration time for each phase. Originally developed by a test automation engineer to be liberated from writing chunks of repetitive code when automating web interactions. ### API <details> <summary> Click to show how to use API webservice that comes with TagUI and how to make complex outgoing API calls </summary> Automation flows can also be triggered via API URL. TagUI has an API service and runner for managing a queue of incoming requests via API. To set up, add a crontab entry on your server with the desired frequency to check and process incoming service requests. Below will check every 15 minutes and run pending flows in the queue. If there's an automation in progress, TagUI will wait for the next check instead of concurrently starting a new run. ``` 0,15,30,45 * * * * /full_path_on_your_server/tagui_crontab ``` To call an automation flow from your application or web browser, use below API syntax. Custom input(s) supported. The default folder to look for flow_filename would be in tagui/src folder. Automation flows can also be triggered from emails using the API. For email integration, [check out Tmail](https://github.com/tebelorg/Tmail). It's an open-source mailbot to act on incoming emails or perform mass emailing; it also delivers emails by API. Emails with run-time variables can be sent directly from your flow with a single line (see flow sample 6C_datatables). For sending SMS, you can use api step to send through SMS-sending services, or host your own from your webserver using services like Skebby ([see this example](https://github.com/adegard/SMS-Marketing) from [@adegard](https://github.com/adegard)). If you have data transformation in your process pipeline [check out TLE](https://github.com/tebelorg/TLE), which can help with converting data. ``` your_website_url/tagui_service.php?SETTINGS="flow_filename option(s)" ``` Besides integrating with web applications, TagUI can be extended to integrate with hardware (eg Arduino or Raspberry Pi) for physical world interactions or machine learning service providers for AI decision-making ability. Input parameters can be sent to an automation flow to be used as variables p1 to p9. Output parameters from an automation flow can be sent to your Arduino or application API URL (see samples 3_github and 6C_datatables). For making outgoing API calls in your automation flow, to feed data somewhere or send emails etc, use the api step followed by full URL (including parameters) of the API call. Raw response from API will be saved in `api_result` variable. If the API response is JSON data, the variable `api_json` will be created for easy access to JSON data elements. If not, `api_json` will be null. For example, `api_json.parent_element.child_element` retrieves value of child_element. ```js api_config = {method:'PUT', header:['Header1: value1','Header2: value2'], body:{'id':123,'pwd':'abc'}}; ``` For advanced API calls, you can set above api_config variable which defaults as `{method:'GET', header:[], body:{}}`. Besides GET, you can use other methods such as POST, PUT, DELETE etc. You can define multiple headers in the format `'Header_name: header_value'` and provide a payload body for PUT requests for example. You can set it as above before using the api step, or set using `api_config.method = 'PUT';` and `api_config.header[0] = 'Header1: value1';` etc. `api_config.body` will be automatically converted to JSON format for sending to API endpoint. In addition, macOS/Linux/Windows commands can be executed using run step. This can be used to call other apps or services hosted locally on the OS, instead of being hosted through webservices. To use