Email Discoverer. Frequently Asked Questions

  1. What are the possible sources of information for Email Discoverer?
  2. How to start using Email Discoverer?
  3. How to create a task?
  4. How to extract email addresses from a website?
  5. What is the deep crawling?
  6. How to extract email addresses from the search engine (Google, Yahoo etc.) results?
  7. How to extract email addresses from a PDF file on the web?
  8. How to extract email addresses from a local file/directory/folder?
  9. What are the threads?
  10. Why do I need to change the number of threads?
  11. What is the meaning of thread priority?
  12. How to control the number of results coming from a search engine?
  13. How to automatically delete duplicate emails?
  14. Which search engines are currently supported?
  15. How to export extracted email addresses?

1. What are the possible sources of information for Email Discoverer?

The Email Discoverer can currently operate with four sources of information:

  • any web site
  • search results generated by a search engine (Google, Yahoo, Live, Yandex) in response to any entered key words
  • PDF file on the web
  • any local file/directory/folder

2. How to start using Email Discoverer?

Please start the software, create a new Task using New button and add either Web sub-task, Key word sub-task or Local file sub-task. The Email Discoverer has a number of template Tasks created in order to demonstrate its functionality.

3. How to create a task?

Press the New button and then in the opened window select its name indicated in the Name field. There are three buttons with '+' sign allowing to add either Web sub-task, Key word sub-task or Local file/Directory sub-task. The Email Discoverer has a number of template Tasks created as an example.

4. How to extract email addresses from a website?

Press the New button and then in the opened window select its name indicated in the Name field. Press 'W+' button and add the name of the website which you would like to use as a source of information. The Email Discoverer has a number of template Tasks created as an example.

5. What is the deep crawling?

The deep crawling allows to process not only a single or individual web page but surf across the website and extract any email addresses encountered within the URL domain. Thus, deep crawling allows to do in top-down direction of the website hierarchy. This process is under control thanks to the parameter called the "Maximum search depth" shown below:

6. How to extract email addresses from the search engine (Google, Yahoo etc.) results?

Press the New button and then in the opened window select its name indicated in the Name field. Press 'K+' button and add the key words which you are interested in. All the results generated by the search engines will be used as a source of information. The Email Discoverer has a number of template Tasks demonstrating this kind of tasks. The serach engines which have to be used in your tasks can be selected in the windows shown below:

7. How to extract email addresses from a PDF file on the web?

Press the New button and then in the opened window select its name indicated in the Name field. Press 'W+' button and add the URL path to the PDF file which you would like to use as a source of information. The Email Discoverer installed by default has an example of Task demonstrating this functionality.

8. How to extract email addresses from a local file/directory/folder?

Press the New button and then in the opened window select its name indicated in the Name field. Press 'F+' button and add local Path to the file which you would like to use as a source of information. The Email Discoverer installed by default has an example of Task demonstrating this functionality.

9. What are the threads?

The Email Discoverer is a multi-threading application. ie. it allows to use several crawling processes in parallel in order to take an advantage of modern multi-core computers.

10. Why do I need to change the number of threads?

You may want to change the number of threads in order to either speed up or slow down the crawling process. For example, if your computer has more than 1 processor you can set the number of threads to something more than 1 and as a result the crawling process will be performed at least 2 times faster. The number of threads can be changed as shown below:

11. What is the meaning of thread priority?

Each thread may have different priorities ranging from low to high. If the priority is low, other applications which you are currently running on your computer will have an advantage over Email Discoverer and vice versa. The priority can be changed as shown below:

12. How to control the number of results coming from a search engine?

The number of results coming from the search engine is under control thanks to the parameter which can be specified as shown below:

13. How to automatically delete duplicate emails?

In order to delete duplicate emails please either select the check button in Task's settings as shown below:

14. Which search engines are currently supported?

The Email Discoverer currently supports Google, Yahoo, Live, Yandex. More serach engines can be added by request. Please send you suggestions to our Forum here http://forum.ssa-outsourcing.com/

15. How to export extracted email addresses?

In order to be able to export email addresses, the Email Discoverer provides an integrated tool allowing to save found emails in CSV format: