🃏3 · PDE Flashcards
Flashcards help to learn and memorise some information.
Q: Can I use cached results for queries against multiple tables using a wildcard?
A: No, cached results are not supported for queries against multiple tables using a wildcard even if the Use Cached Results option is checked
https://cloud.google.com/bigquery/docs/querying-wildcard-tables
Q: How many Gigabytes should your BigTable table have to be a good test case for performance workloads?
A: Use at least 100 GB for your workload tests
Q: What is the keyword in BigQuery standard SQL used when selecting multiple tables with wildcard by their suffices?
A: _TABLE_SUFFIX
https://cloud.google.com/bigquery/docs/querying-wildcard-tables
Q: Why is it not recommended to use the NOW() function in BigQuery queries?
A: Using the NOW() function can significantly impact query performance and cost, due to the overhead of retrieving the current timestamp from the server.
https://cloud.google.com/bigquery/docs/reference/standard-sql/date_functions
Q: What are the two main steps involved in copying data between tables in BigQuery using a SELECT statement?
A: Create a new table schema and insert data from existing dataset to the new one using INSERT SELECT
statement
As of 21 December 2023
the most convenient way to clone BigQuery tables is by using CLONE instead SELECT
https://cloud.google.com/bigquery/docs/table-clones-create
CREATE TABLE
myproject.myDataset_backup.myTableClone
CLONE myproject.myDataset.myTable;
Q: What is the maximum amount of data that a BigQuery JavaScript UDF can output when processing a single row?
R: Approximately 5 MB
Q: Is possible store data by using the preemtible node works build-in storage?
A: Preemptible worker nodes do not have persistent disks. This means that you cannot store data on the worker nodes themselves. Instead, you will need to store your data in Cloud Storage and use the HDFS connector to access it from your worker nodes.
https://cloud.google.com/dataproc/docs/concepts/compute/secondary-vms
Q: You can create a Dataproc cluster with internal IP addresses only by using which flag?
A: The gcloud dataproc clusters create
command with the ‑‑no-address
flag.
Q: Which action would you undertake when you need to encrypt data at rest with encryption keys that you can create, rotate and destroy as needed?
A: Create encryption keys in Cloud Key Management Service (KMS). Use those keys to encrypt your data as needed.
Q: Which actions are able to run a BigQuery Job?
A: Jobs run for all BigQuery actions including loading, exporting, querying or copying data
Last updated