random Flashcards

Question

Python: To iteratively replace a list of characters with spaces, type

Answer 1

for item in [".", "?", "!"]: | text = text.replace(item, " ")

Answer 2

soup_page = BeautifulSoup(page, "html.parser") for script in soup_page.find_all(["script", "style"]): script.extract() text = soup.get_text() for item in [".", "?", "!", ",", " "]: text = text.replace(item, " ")

Answer 3

df = df.append({"column1":"value", "column2":"value", "column3":"value"}, ignore_index=True)

Answer 4

2x2 matrix with the y index of actual class and x index of predicted class, that counts how many values of each class were correctly or incorrectly predicted. It is a measure of how many false positives and false negatives there are.

Answer 5

Not use select percentile use a stemmer remove irrelevant symbols and characters use Tfidf instead of CountVect

Answer 6

not have mislabeled data

Answer 7

the all features you think have high information gain, and do whatever is necessary to get them into the dataset.

Answer 8

to outperform other models, and if it does it may be because the data has duplicates

Answer 9

os.system("start /file/path.csv")

Answer 10

*if using DataFrameMapper* | sklearn_pandas.GridSearchCV(pipeline, param_grid=param_grid, verbose=3, scoring="accuracy", cv=10)

Answer 11

impute the data based on hints in the other columns. eg Mr. is associated with older age.

Answer 12

import sklearn_pandas param_grid = {"setname__parameter":[10, 20, 30]} grid_model = sklearn_pandas.GridSearchCV(pipeline, param_grid=param_grid, verbose=3, scoring="accuracy", cv=10)

Answer 13

have any custom transformer (I think)

Answer 14

That it is a variable name that must be assigned in one of the custom parameters.

Answer 15

a nomal CSV, not the Excel type

Answer 16

df = df.reindex_axis(["Conversions"] + [item for item in df.columns if item !="Conversions"], axis=1)

Answer 17

class Myclass(Parentclass1, Parentclass2):

Answer 18

verbose=3 | sklearn_pandas.GridSearchCV(pipe, param_grid, verbose=3)

Answer 19

cv =10 | sklearn_pandas.GridSearchCV(pipe, param_grid, verbose=3, scoring="accuracy", cv=10)

Answer 20

class Myclass: def __init__(self, **args): def get_attribute(self): my_attribute = input("Attribute query?") if my_attribute != "chosen attribute": return get_attribute() else: get_attribute() self.my_attribute = self.get_attribute()

Answer 21

my_list[2] += 1 or my_list[2] = my_list[2] +1

Answer 22

grouping common operations into functions and common functionality in classes.

Answer 23

from otherclassfile import Myotherclass class Myclass(Myotherclass):

Answer 24

class Myclass(Otherclass, Otherclass2)

Answer 25

create a new function in the current class with the same name

Answer 26

import it.

Answer 27

in def __init__(self, **args):

Answer 28

the Object class

Answer 29

removes any non unique values and order the rest in ascending order.

Answer 30

it's own brackets

Answer 31

pandas. set_option("display.max_rows", 1000) | pandas. set_option("display.max_columns", 1000)

Answer 32

an external file that must be imported into the file you are running.

Answer 33

sum(my_list)

Answer 34

n_estimators, max_features, max_depth, min_samples_leaf

Answer 35

the task is called regression.

Answer 36

main top left to bottom right has the highest numbers because that signifies correct classifications.

Answer 37

``` the rate of how often the algorithm misclassifies a sample that is in fact a certain class as another one. "This class only gets classified correctly x percent of the time." When a sample is in fact a certain class, how often is it classified correctly. ``` "in fact" Measure of false negatives for a class. true positives/(false negatives + true positives)

Answer 38

the rate of how often when a classification is made, how often it is correct. "When a classification is finally made for this sample we are x sure that is was made correctly" How often do samples of other classes get mistaken for this class. "when classification is made" Measure of true positives for a class. true positives/(false positives + true positives)

Answer 39

precision, recall, or f1 score (which is both) on a class by class basis.

Answer 40

precision score and recall score

Answer 41

scoring="f1"

Answer 42

binned the times into sections of the day and used a NearestNeighbors classifier to predict based on locality.

Answer 43

should not have any bearing on the label so any features with an information gain lower than the spurious attribute can be ignored.

Answer 44

overfitting

Answer 45

The population is grouped by a characteristic, and then a number of samples is pulled from each group to represent it.

Answer 46

you group samples based on a characteristic but then only pull samples from one of the groups.

Answer 47

All of the samples are grouped together and chosen chosen at random and then returned back into the pool at each draw.

Answer 48

A text editor and the command is: vi my_file.py

Answer 49

my_list.sort()

Answer 50

my_list.reverse()

Answer 51

my_list.count("value")

Answer 52

list(map(lambda x: x*2, my_list))

Answer 53

an object, not a list.

Answer 54

os.listdir("/users/student/desktop")

Answer 55

a list comprehension assigned to a variable would do.

Answer 56

make the labels binary (1 and 0) only.

Answer 57

n_jobs=-1 to the parameters

Answer 58

best_parameters = grid_search.best_estimator_.get_params()

Answer 59

re-instantiate the instance afterwards so it can take on the new attributes.

Answer 60

case sensitive

Answer 61

df["Prediction"] = pandas.Series(model_grid.predict(my_transformer(df_features)))

Answer 62

my_browser.current_url

Answer 63

model_grid.get_params_ | model_grid.best_score_

Answer 64

model_grid.get_params_

Answer 65

my_array = np.array([1,2], [3,4]) my_array.T array([ [1, 3], [2, 4] ])

Answer 66

numpy.concatenate((a, b), axis=1)

Answer 67

usually incentivized traffic for a short time.

Answer 68

a sign up or download in exchange for a bribe like in game currency.

Answer 69

HyperText Transfer Protocol

Answer 70

text with links in it

Answer 71

rules for getting data from one place to another

Answer 72

Representational State Transfer

Answer 73

all information necessary to respond to a request is available in each individual request; no data, or state, is held by the server from request to request

Answer 74

my_numpy_array.dtype

Answer 75

DROP TABLES tablename;

Answer 76

DROP TABLES tablename1, tablename2;

Answer 77

INSERT INTO tablename VALUES ("String 1", "String 2"), ("String 1", "String 2");

Answer 78

surrounded by quotes

Answer 79

import re | re.match(r'^org/?P\w+/$', 'org/companyA')

Answer 80

import re file = open("my_file.txt", encoding="utf-8") data = file.read() file.close()

Answer 81

if (True and True) and (False or True) or (False and False):

Answer 82

%pylab inline

Answer 83

rename, not replace

Answer 84

df.to_string(index=False)

Answer 85

import render_template ``` @app.route("/") def my_view_function(): return render_template("file.html") ```

Answer 86

``` put {{ var_name }} in the template pass the variable into render_template like return render_template("file.html", var_name=var_name) ```

Answer 87

set defaults for the variable that are supposed to be passed in.

Answer 88

templates to pull a variable into it from the view.

Answer 89

the template directory

Answer 90

variable: {{ var_name }} block: {%block my_block %}{% endblock %}

Answer 91

have quotes around it

Answer 92

{% extends "layout.html" %} {% block title %}{{ super() }} My Title Tag{% endblock %} {% block body_content %}

This is the content of my body

{% endblock %}

Answer 93

from flask import redirect from flask import url_for ``` @app.rout("/save") def save(): return redirect(url_for("view_function")) ```

Answer 94

@app.route("/save", methods=["POST"])

Answer 95

request.form

Answer 96

import json import make_response import redirect import url_for @app.route("/save", methods=["POST"]) def save_view(): response = make_response(redirect(url_for("index.html"))) response.set_cookie("cookie_name", json.dumps(dict(request.form.items()))) return response

Answer 97

{% block my_body %} - form action="{{ url_for("save") }}" method="POST"> - label>Form title-/label> - input type="text" name="name" value="" autofocus> - input type="submit" value="default!"> - /form> {% endblock%}

Answer 98

import request in the app file from flask import request

Answer 99

the response to the browser

Answer 100

a dict with the key as the name from the name parameter in the form, and the value as the value inputted into the form.

Answer 101

a name (which you give) and a value which is a dict with the name and value from the form field.

Answer 102

``` def get_cookies(): try: cookie = json.loads(request.cookies.get("cookie1")) except: cookie = {} return data ``` @app.route("/save", methods=["POST"]) def save(): response = make_response(redirect(url_for("index"))) cookie = get_cookies() cookie.update(dict(request.form.items())) response.set_cookie("cookie1", json.dumps(cookie)) return response

Answer 103

``` create the function that returns the cookie in dict format. def get_saved_data(): try: data = json.loads(request.cookies.get("cookie name")) except: data = {} return data ``` ``` Pass the cookie dict into the template. @app.route("/") def index(): data = get_saved_data() return render_template("index.html", data=data) ``` Set the value in the form.

Answer 104

request.form.items()

Answer 105

{% for item in my_list %} -li>-h2>item-/h2>-/li> {% endfor %}

Answer 106

most import parent class last.

Answer 107

app.py, templates, static

Answer 108

-link rel="stylesheet" href="../static/styles.css">

Answer 109

- form action="" method="" enctype=multipart/form-data> - input type="file" value="value" name="name"> - /form>

Answer 110

I am already in my static directory, so I can reference the files directly without changing levels.

Answer 111

ALLOWED_EXTENSIONS = set(['txt', 'pdf', 'png', 'jpg', 'jpeg', 'gif']) ``` app = Flask(__name__) app.config['UPLOAD_FOLDER'] = '/home/alpalalpal/mysite/static' ``` @app.route('/4', methods=['GET', 'POST']) def upload_file(): if request.method == 'POST': file = request.files['file'] if file and allowed_file(file.filename): filename = secure_filename(file.filename) file.save(os.path.join(app.config['UPLOAD_FOLDER'], filename)) return redirect(url_for('uploaded_file', filename=filename)) return ''' Upload new File

Upload new File

'''

Answer 112

pyautogui.scroll(-10)

Answer 113

an open-source software framework written in Java for distributed storage and distributed processing of huge data sets.

Answer 114

do type checking, which is verifying and enforcing the constraints of types at compile-time as opposed to run-time.

Answer 115

an algorithm that allows you to query data in parallel on a distributed cluster of computers.

Answer 116

a terabyte of data

Answer 117

volume, variety, veracity and velocity

Answer 118

library of scalable machine-learning algorithms, implemented on Apache Hadoop

Answer 119

Hadoop Distributed File System

Answer 120

one heavy duty computers and then 10-15 commodity computers

Answer 121

single point in a network or single computer in a cluster.

Answer 122

import os | os.chdir("C:\\folder")

Answer 123

restarting he kernel

Answer 124

the site that placed them.

Answer 125

``` def log(func): def inner(): print("string") return func() return inner ``` ``` @log def say_hello(): return "Hello there!" ``` say_hello()

random Flashcards

(162 cards)

This is the content of my body

Upload new File