Skip to content
Projects
Groups
Snippets
Help
This project
Loading...
Sign in / Register
Toggle navigation
M
mobvista-dmp
Project
Overview
Details
Activity
Cycle Analytics
Repository
Repository
Files
Commits
Branches
Tags
Contributors
Graph
Compare
Charts
Issues
0
Issues
0
List
Board
Labels
Milestones
Merge Requests
0
Merge Requests
0
CI / CD
CI / CD
Pipelines
Jobs
Schedules
Charts
Wiki
Wiki
Snippets
Snippets
Members
Members
Collapse sidebar
Close sidebar
Activity
Graph
Charts
Create a new issue
Jobs
Commits
Issue Boards
Open sidebar
王金锋
mobvista-dmp
Commits
48220007
Commit
48220007
authored
Aug 23, 2021
by
fan.jiang
Browse files
Options
Browse Files
Download
Email Patches
Plain Diff
dsp_req分区数据人群包临时产出s3,供产品使用
parent
3fe9507c
Expand all
Hide whitespace changes
Inline
Side-by-side
Showing
3 changed files
with
35 additions
and
0 deletions
+35
-0
tmp_extract_data_from_dsp_req.job
azkaban/dsp/tmp_extract_data_from_dsp_req.job
+3
-0
tmp_extract_data_from_dsp_req.sh
azkaban/dsp/tmp_extract_data_from_dsp_req.sh
+32
-0
TmpExtractDataFromDspReq.scala
...obvista/dmp/datasource/dsp/TmpExtractDataFromDspReq.scala
+0
-0
No files found.
azkaban/dsp/tmp_extract_data_from_dsp_req.job
0 → 100644
View file @
48220007
type=command
command=sh -x ./tmp_extract_data_from_dsp_req.sh
\ No newline at end of file
azkaban/dsp/tmp_extract_data_from_dsp_req.sh
0 → 100644
View file @
48220007
#!/bin/bash
source
../dmp_env.sh
ScheduleTime
=
${
ScheduleTime
:-
$1
}
LOG_TIME
=
$(
date
-d
"
$ScheduleTime
1 days ago"
"+%Y-%m-%d"
)
dt
=
$(
date
-d
"
$ScheduleTime
1 days ago"
"+%Y%m%d"
)
date_path
=
$(
date
-d
"
$ScheduleTime
1 days ago"
"+%Y/%m/%d"
)
old_path
=
$(
date
-d
"
$ScheduleTime
2 days ago"
"+%Y/%m/%d"
)
rm_dt
=
$(
date
-d
"
$ScheduleTime
180 days ago"
"+%Y%m%d"
)
rm_dt_path
=
$(
date
-d
"
$ScheduleTime
180 days ago"
"+%Y/%m/%d"
)
Tmp_Extract_Data_From_DspReq_Path
=
"s3://mob-emr-test/dataplatform/DataWareHouse/data/dwh/tmp/rtdmp_tmp_extract_data_from_dspReq_path"
ETL_DSP_REQ_ETL_HOURS_INPUT_PATH
=
"
${
ETL_DSP_REQ_ETL_HOURS
}
/
$date_path
/*/*"
check_await
"
${
ETL_DSP_REQ_ETL_HOURS
}
/
$date_path
/23/_SUCCESS"
hadoop fs
-rm
-r
${
Tmp_Extract_Data_From_DspReq_Path
}
spark-submit
--class
mobvista.dmp.datasource.dsp.TmpExtractDataFromDspReq
\
--conf
spark.yarn.executor.memoryOverhead
=
3072
\
--conf
spark.sql.shuffle.partitions
=
10000
\
--files
${
HIVE_SITE_PATH
}
\
--master
yarn
--deploy-mode
cluster
--executor-memory
10g
--driver-memory
6g
--executor-cores
4
--num-executors
100
\
../
${
JAR
}
-input
$ETL_DSP_REQ_ETL_HOURS_INPUT_PATH
\
-output
${
Tmp_Extract_Data_From_DspReq_Path
}
\
-coalesce
200
||
exit
1
src/main/scala/mobvista/dmp/datasource/dsp/TmpExtractDataFromDspReq.scala
0 → 100644
View file @
48220007
This diff is collapsed.
Click to expand it.
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment