KX Interview Questions 03/11/22 Flashcards

Question

Name two other namespaces?

Answer 1

Reading/Memory mapping a variable or kdb file. Returns its value. Often used to map columns of data from a splayed table into memory.

Answer 2

Assign a value to a global variable or save an object as a file/directory. To splay a table: "dir set t/set[dir;t].

Answer 3

Load binary data from a file/directory. Loads the file but does not return the contents until called on. rload is used to load a splayed table from a directory.

Answer 4

Save to a binary file.

Answer 5

File text operator. 5 forms: 1. Prepare text (Table to delimited strings) (delimiter/csv 0: t 2. Save Text (Save strings) (file symbol 0: strings) 3. Load csv (Load delimited strings) 4. load fixed (load other format lists/strings/matrix) 5. Key Value Pairs delimited string as key value pairs

Answer 6

Used to read text from a file or process handle. read0 (`:foo;6;5) "world"

Answer 7

File binary operator. Used to read and parse or write bytes.

Answer 8

Read bytes from a file.

Answer 9

free command. free -h

Answer 10

du = disk usage and df = disk space available

Answer 11

\\ to exit q program

Answer 12

My undergraduate degree was psychology and computing. It combined psychological research, general computing, and design modules to cater for the growing importance of human computer interaction research and development of psychologically rewarding and not damaging technologies, as the world gets more technology driven with new technologies like AI and Virtual reality. I studied skills such as programming, data mining, UX design, UI design, Interaction Design, Usability design and User research alongside research modules such as research methods and statistical analysis for research. All of our class members gained accreditation by the Psychological Society of Ireland for our research.

Answer 13

My post graduate degree was in business information and analytical systems. It was a highly technical business analytics degree. We had project management modules, business analytics modules and highly technical data analytics and programming modules. We used R, Lindo and Python for these modules. We had a year-long project within the program where we created a product demand forecasting system in python that used open source met Eireann weather forecast data to predict product demand of weather sensitive products such as ice cream, barbeque food, cold and hot drinks etc. We created a full business plan with employment , marketing and finance need for the deployment of our product.

Answer 14

The technical modules such as 3 years of programming with python, data mining, relational databases. I'm sure the design modules will also be helpful in the future.

Answer 15

Programming for data analytics with Python, prescriptive analytics with LINDO and descriptive and predictive analytics with R. We also had an AWS cloud technology module which I will likely use in the future alongside some of the business analytics and project management modules.

Answer 16

I enjoy playing soccer and going to the gym in my free time. I frequently travel as well whenever I get the chance.

Answer 17

Python, R, LINDO, KDB now, and some SQL.

Answer 18

We had a year-long project within the program where we created a product demand forecasting system in python that used open source met Eireann weather forecast data to predict product demand of weather sensitive products such as ice cream, barbeque food, cold and hot drinks etc. We created a full business plan with employment , marketing and finance need for the deployment of our product.

Answer 19

Business data strategy and management, IS project planning and oversight, descriptive and predictive analytics, prescriptive analytics, programming for data analysis, cloud technologies, secure data acquisition and management. SQL Relational databases. Python programming. Data Mining with pandas.

Answer 20

Python good. KDB nearly as good but still improving. SQL okay but needs recapping as I haven't done my SQL module since my first year of college.

Answer 21

What is the role in detail, what kind of projects will I be working on and how big is the team?

Answer 22

-7h is an atom long. 7h is a vector of longs.

Answer 23

Simple (one file saved as delimited text) Splayed (a table folder where each column is its own subfolder) Partitioned (partitioned on time each partitions have a splayed table within) Segmented (different partitions in different storage locations)

Answer 24

Contains all of the unique symbols in a table and their enumerations so that the table can be mapped back into memory as it was before it was enumerated.

Answer 25

A splayed table is a table folder that contains files for each column. A partitioned table is a table folder that contains sub folders of time values within each time value is a splayed table of all data that occurred within that time interval

Answer 26

Time series databases often get too long to have saved in one storage location. Segmenting a table saves large intervals of the tables to different storage locations. values from multiple storage locations can be accessed at once through parallel computing.

Answer 27

par.txt contains the directories of each partition in a segmented database.

Answer 28

m: (1 2 3; `a`b`c)

Answer 29

f: {[parameters] logic}

Answer 30

When a function is passed with a less than defined arguments. It can be used to keep one or more arguments constant.

Answer 31

Data feed is the source of data usually formatted in a compiled language that is sent to the feed handler which converts the data to Q column-oriented rows. These rows are then sent to the Ticker plant which logs the message, converts the rows into Q tables with time and sym as their first two columns and then sends these tables to the RDB where todays data can be accessed and queried in memory. At the end of day this data is sent to the HDB and discarded from the RDB. The HDB holds the data loads the data from on disk partitioned tables which can be accessed but not modified by the client via the gateway.

Answer 32

?[t;c;b;a] t is for table name; c is for a list of conditions (where) b is a dictionary of grouping constraints (the by condition) and a is the list of aggregates (what is selected)

Answer 33

Passing column names as arguments created by the outputs of parse.

Answer 34

opening a port = q -p 5000 in the command line or \p 5000 in the q session. listening to that port in another process is done.

Answer 35

Synchronous is used when you want your connecting process to get a return/output from an action taken on another process getting tables/variables. Asynchronous neg is used when you don't want an output such as assigning variables or sending tables.

Answer 36

The feedhandler uses an asynchronous connection to the tickerplant as the feedhandler does not need to receive or show any data from the tickerplant. The RDB connects to the tickerplant so that it can produce output of the tables it has received etc.

Answer 37

Grouped (no structure) Unique (no repeated items) Parted (contiguous items that aren't ordered) and Sorted (Ascending items)

Answer 38

parallel computing taking place on different cores and threads for increased performance.

Answer 39

shows the user ID as a symbol.

KX Interview Questions 03/11/22 Flashcards

(70 cards)