I have to calculate % contribution for each category. select portfolio,Portfolio_views,Portfolio_views/total_views*100 as perc_contribution from ( select categ
how can to see the correlations regarding the target column of the trained models? Is there an option in the Redshift query editor oder Sagemaker? Cheers
I would like to generate a sequence of dates in Redshift. I can easily do it in PostgreSQL using the generate_series() function. This function works in Redshift
Sorry in advance for this dumb question. I am just begining with AWS and Pyspark. I was reviewing pyspark library and I see pyspark need a tempdir in S3 to be a
Having a problem launching PySpark job that utilizes connection to RedShift via awswrangler lib. Everything works fine if using --additional-python-modules: aws
I've parquet files and need to load into redshift using copy command. The command is getting failed due to spectrum scan error. So I want to ignore the file if
Is it possible to create & publish redshift table directly to Tableau server as a datasource using python? It'll be basically select * from table. I looked
I would like to make the first letter of each word capitalized. However, if the word next to the first letter is already capitalized, then it should not be affe
I have ORC files that have columns with double datatype in the file, these columns are queryable in AWS Athena as numeric(18,0). This is the best I could find o
I'm trying to unload redshift data to S3, but it's unloading in CSV format. How can unload the Redshift table to S3 bucket in parquet format using Java?
I have an interesting problem to find the root cause. Inner join 2 tables on 2 columns, but the results are not expected. The data types for both columns are th
I am trying to get start date of the week from existing daily date field from the same table. For example daily dates from 05/08/2022 to 05/14/2022 , the start
This is working: SELECT * FROM table WHERE name LIKE '%ab_cd%' How to include '%ef_gh%' in a single/multiple statement? My try: SELECT * FROM table WHERE name
I am having some data in aurora mysql db, I would like to do two things: HISTORICAL DATA: To read the data from aurora(say TABLE A) do some processing and updat
I have created a external table in redshift spectrum.Upon running the select * from table_name, i am getting following error SQL Error [XX000]: ERROR: Spectrum
I've been trying to filter the data for the last X number of days. All these columns work as standalone results when I remove the time filter from the where cla
I have a folder containing files in parquet format. I used crawler to create table defined in Glue Data Catalog which counted to 2500+ columns. I want to create
I am ingesting (COPY) json data from S3. Initially storing in a single SUPER column, and then extracting out the individual attributes into a normalised table/c
Suppose I have 2 tables: Table A C_ID P_ID 1 1 1 2 2 1 Table B P_ID 1 2 In Table A, C_ID and P_ID serve as PK, in Table B P_ID
I have a Redshift datawarehouse that is pulling data in from multiple sources. One is my from MySQL and the others are some cloud based databases that get pulle