Skip to content
Projects
Groups
Snippets
Help
This project
Loading...
Sign in / Register
Toggle navigation
M
mobvista-dmp
Project
Overview
Details
Activity
Cycle Analytics
Repository
Repository
Files
Commits
Branches
Tags
Contributors
Graph
Compare
Charts
Issues
0
Issues
0
List
Board
Labels
Milestones
Merge Requests
0
Merge Requests
0
CI / CD
CI / CD
Pipelines
Jobs
Schedules
Charts
Wiki
Wiki
Snippets
Snippets
Members
Members
Collapse sidebar
Close sidebar
Activity
Graph
Charts
Create a new issue
Jobs
Commits
Issue Boards
Open sidebar
王金锋
mobvista-dmp
Commits
9d6e95e0
Commit
9d6e95e0
authored
Sep 30, 2021
by
fan.jiang
Browse files
Options
Browse Files
Download
Email Patches
Plain Diff
com.taobao.taobao_iqiyi_3
parent
beb0126b
Hide whitespace changes
Inline
Side-by-side
Showing
2 changed files
with
18 additions
and
2 deletions
+18
-2
com_eg_android_AlipayGphone_reyun.sh
...ge_to_other_business/com_eg_android_AlipayGphone_reyun.sh
+7
-2
ComEgAndroidAlipayGphoneReyun.scala
...sta/dmp/datasource/dm/ComEgAndroidAlipayGphoneReyun.scala
+11
-0
No files found.
azkaban/dm/pseudo_package_to_other_business/com_eg_android_AlipayGphone_reyun.sh
View file @
9d6e95e0
...
...
@@ -8,6 +8,7 @@ source ../../dmp_env.sh
dt_today
=
$(
date
-d
"
$ScheduleTime
1 days ago"
+
"%Y%m%d"
)
dt_slash_today
=
$(
date
-d
"
$ScheduleTime
1 days ago"
+
"%Y/%m/%d"
)
update
=
$(
date
-d
"
$ScheduleTime
15 days ago"
+
"%Y-%m-%d"
)
update02
=
$(
date
-d
"
$ScheduleTime
3 days ago"
+
"%Y-%m-%d"
)
check_await
"
${
TMP_EGGPLANTS_OUTPUT_PATH
}
/
${
dt_slash_today
}
"
...
...
@@ -36,8 +37,8 @@ spark-submit --class mobvista.dmp.datasource.dm.ComEgAndroidAlipayGphoneReyun \
--conf
spark.sql.shuffle.partitions
=
3000
\
--conf
spark.driver.maxResultSize
=
4g
\
--conf
spark.network.timeout
=
720s
\
--master
yarn
--deploy-mode
cluster
--executor-memory
8g
--driver-memory
6g
--executor-cores
5
--num-executors
1
0
\
../../
${
JAR
}
-dt_today
${
dt_today
}
-output1
${
OUTPUT_PATH1
}
-output2
${
OUTPUT_PATH2
}
-update
${
update
}
\
--master
yarn
--deploy-mode
cluster
--executor-memory
8g
--driver-memory
6g
--executor-cores
5
--num-executors
2
0
\
../../
${
JAR
}
-dt_today
${
dt_today
}
-output1
${
OUTPUT_PATH1
}
-output2
${
OUTPUT_PATH2
}
-update
${
update
}
-update02
${
update02
}
\
-coalesce
200
...
...
@@ -57,4 +58,7 @@ com.ss.android.ugc.aweme_reyun
com.taobao.litetao_btop
com.ss.android.ugc.aweme_btop
天级别入库dmp的business=other分区
更新
获取过去3天的爱奇艺淘宝安装包名的设备入库,针对com.taobao.taobao_iqiyi,
限制过去三天活跃的设备伪包名入库。人群包名称可定为com.taobao.taobao_iqiyi_3
'
\ No newline at end of file
src/main/scala/mobvista/dmp/datasource/dm/ComEgAndroidAlipayGphoneReyun.scala
View file @
9d6e95e0
...
...
@@ -29,6 +29,7 @@ class ComEgAndroidAlipayGphoneReyun extends CommonSparkJob with Serializable {
options
.
addOption
(
"output2"
,
true
,
"[must] output2"
)
options
.
addOption
(
"dt_today"
,
true
,
"[must] dt_today"
)
options
.
addOption
(
"update"
,
true
,
"[must] update"
)
options
.
addOption
(
"update02"
,
true
,
"[must] update02"
)
options
}
...
...
@@ -44,6 +45,7 @@ class ComEgAndroidAlipayGphoneReyun extends CommonSparkJob with Serializable {
val
output2
=
commandLine
.
getOptionValue
(
"output2"
)
val
dt_today
=
commandLine
.
getOptionValue
(
"dt_today"
)
val
update
=
commandLine
.
getOptionValue
(
"update"
)
val
update02
=
commandLine
.
getOptionValue
(
"update02"
)
val
spark
=
SparkSession
.
builder
()
.
appName
(
"ComEgAndroidAlipayGphoneReyun"
)
...
...
@@ -88,6 +90,15 @@ class ComEgAndroidAlipayGphoneReyun extends CommonSparkJob with Serializable {
| and package_name in ('com.taobao.litetao','com.ss.android.ugc.aweme')
| and device_type in ('imei','gaid','oaid','idfa','imeimd5','gaidmd5','oaidmd5','idfamd5')
| and update_date >= "${update}"
|union
|select
| device_id, device_type ,concat("[\\"",package_name,"_3","\\"]") as package_name
|from
| dwh.dm_install_list_v2
|where dt='${dt_today}' and business in ('dsp_req')
| and package_name in ('com.taobao.taobao_iqiyi')
| and device_type in ('imei','gaid','oaid','idfa','imeimd5','gaidmd5','oaidmd5','idfamd5')
| and update_date >= "${update02}"
"""
.
stripMargin
val
df01
:
DataFrame
=
spark
.
sql
(
sql1
).
persist
(
StorageLevel
.
MEMORY_AND_DISK_SER
)
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment