Content Discovery Flashcards

1
Q

What is Content Discovery?

A

When speaking of Content Discovery in a Hacker sense, we’re interested in the content that we can’t immediately see on the page. Some examples of things that we’d be looking for during Content Discovery would be: pages or portals used for staff, older versions of the site, backup and configuration files, and even admin panels.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What’re the three ways that we’ll be doing Content Discovery?

A

Automated, Manually, and OSINT(Open-Source Intelligence)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Manual Discovery: What are Robots.txt files?

A

This is a text document that tells search engines which pages they are and aren’t allowed to show on their search engine results or they can say to ban specific search engines from crawling the website altogether.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Manual Discovery Information:

A

Each of these methods have multiple ways of executing them.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Why are Robot.txt files common?

A

It’s common to restrict certain websites areas like admin portals so that not just anyone can simple access them from a search engine.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Manual Discovery: What are Sitemap.xml?

A

Sitemap.xml is basically the opposite of Robots.txt, it gives a list of every file the website owner wishes to be displayed by search engines.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Manual Discovery: What are Framework Stacks?

A

As established before, when you discover the type of framework that you’re working with, you’ll be able to also find its inherent vulnerability. This vulnerabilities are often tracked on Frameworks Stacks, basically just sites that house this information.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Manual Discovery: What are HTTP Headers?

A

HTTP Headers are sent back to the client when you request something from the web server.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

OSINT: What is Google Hacking/Docking?

A

This method is used through utilizing advanced search engine filters

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Types of Search Engine Filters:

A

site: - Returns results only from the specified website address

inurl: - Returns results that have specified word in the URL

filetype:- Returns results which are a particular file extension

intitle:- Returns results that contain the specified word in the title

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

OSINT: What is Wappalyzer?

A

Wappalyzer is an online tool o browser extension thats used to help identify all technologies used on a website.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

OSINT: What’s are S3 Buckets?

A

S3 buckets are an external program provided by AmazonAWS that allow you to save files and even make pages dormant on the cloud to be accessed by HTTP or HTTPS.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What’s Automated Discovery?

A

Automated Discovery is the process of using tools to do content discovery as a pose too doing it manually. These tools will be automated because they contain hundreds, thousands, even millions of request to a web server.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Automated Discovery: What do these request do?

A

These request will check whether or not a file or directory exist on a website, and this process is made possible by use of a tool called wordlists.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Automated Discovery: What’s a Wordlist?

A

Wordlists are just text files thats contain a long-list of commonly used words.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What are automation tools?

A

Automation tools are a tool used for content discovery.

17
Q

What are the main tool that you’ll be using for automated discovery?

A

ffuf, dirb, and gobuster