Statistics: Apps with summary/description by language at f-droid.org

I am currently implementing a java library to load the f-droid app catalog “index-v1.jar” into a local database.

“index-v1.jar” downloaded in april contains 3798 apps. All have english summary or description. :slight_smile:

here is the result for number of summary/description by language

Example: de German 2985 means 2985 apps with summary/description in language de=German

CODE   NAME_ENGLISH                       C3
-----  -------------------------------  ----
en     English                          3798
de     German                           2985
fr     French                           2809
ru     Russian                          2765
pt     Portuguese                       2738
zh-CN  Simplified Chinese               2718
pl     Polish                           2705
tr     Turkish                          2685
uk     Ukrainian                        2665
ro     Romanian                         2621
he     Abrit                            2612
es     Spanish                          1934
it     Italian                          1454
nb     Norwegian Bokmål                 1363
zh-TW  Traditional Chinese / Taiwanese   951
eo     Esperanto                         918
ja     Japanese                          894
gd     Scottish Gaelic                   580
et     Estonian                          542
ar     Arabic                            541
sq     Albanian                          524
id     Indonesian                        472
nn     Norwegian                         389
is     Icelandic                         365
el     Greek                             350
vi     Vietnamese                        345
fi     Finland                           341
bn     Bengali                           296
hu     Hungarian                         288
ko     korean                            262
sv     Swedish                           251
nl     Dutch                             250
cs     Čeština                           241
ca     Catalan                           209
lt     Lithuanian                        193
hi     Hindi                             188
sk     Slovak                            183
bg     Bulgarian                         181
da     Danish                            176
hr     Croatian                          174
th     Thai                              170
af     Afrikaans                         164
sl     Slovene                           163
lv     Latvian                           157
sw     Swahili                           130
eu     Basque                            106
am     Amharic                            91
fa     Persian                            85
sc     Sardinian                          61
no     Norwegian                          47
sr     Serbian                            46
cy     Welsh                              31
gl     Galipe                             27
ta     Tamil                              23
be     Belarusian                         17
mk     Macedonian                         13
bo     Tibetan                            12
si     Sinhala / Sinhalese                11
az     Azerbaijani                        11
ml     Malayalam                          10
ur     Urdu                                9
zh-HK  Chinese Honkong                     9
mr     Marathi                             8
ms     Malay                               6
hy     Armenian                            4
my     Myanmarian                          4
uz     Uzbek                               4
ps     Pashto                              2
ka     Georgian                            1
2 Likes

Here is the statistic for “number of apps” order by minsdk

Example: 248 08 means 248 apps with minSdk=8 (Android-2.2)

CNT  MINSDK
---  ------
  1  ,,    
 10  01    
  9  02    
156  03    
154  04    
 43  05    
  5  06    
197  07    
248  08    
139  09    
102  10    
 69  11    
 11  12    
  4  13    
380  14    
321  15    
486  16    
133  17    
 52  18    
272  19    
 12  20    
570  21    
 50  22    
187  23    
 86  24    
 11  25    
 65  26    
  2  27    
 15  28    
  7  29    
  1  30    
1 Like

Romaniab typo

1 Like

fixed (in post and in my program)

Can you also give counts of how many use which repo host - github, gitlab, sourcehut,…?

Edit+:

Also

7 29
1 30

Which apps are using most recent?

That’s very easy. Just open the index-v1.json in an editor and search "sourceCode": "https://github.com.

This is what I got.

repo count
github 2986
gitlab 350
google code 68
codeberg 62
bitbucket 56
sourceforge 50
sourcehub 4
gitea 1

And some selfhost repos.

1 Like

Thanks @linsui ! Very interesting. And sad, but not too surprising, to see how many are still on Microsoft’s github and other proprietary, or non GPL systems.

To others who also may not know intuitively, “just open in an editor and search” means first download https://f-droid.org/repo/index-v1.jar (mentioned various places including here: https://f-droid.org/en/2021/02/05/apis-for-all-the-things.html), using an archiver program or unzip command to expand index-v1.jar or extract index-v1.json , a 26+ MB file; then having an editor program and computer that will search and display counts in a reasonable amount of time. This does not include Mousepad, less, vi or nano with default settings on an old Thinkpad. For me it would probably mean using some sed and grep, and patience. :slight_smile:

:rofl: Yes, the file is pretty big. I use VSCode/VSCodium which uses ripgrep internally and it give me the result in 1 sec. You can use ripgrep directly:

rg --count-matches '"sourceCode": "https://github.com' index-v1.json
2986

You can get a similiar result at Search · SourceCode: https://github · GitHub I don’t know why there is difference though.

The most interesting part it the there is still so many repos on google code though it has been closed.

Maybe we need to check if those moved elsewhere, if they had updates or most importantly, if they still work.

2 Likes

Fun with termux

$ time for r in “” github gitlab code.google codeberg bitbucket sourceforge sr.ht gitea ; do echo $r ; rg --count-matches "“sourceCode”: “https://$r” index-v1.json ;done 3767
github 2987
gitlab 361 code.google 68 codeberg 62 bitbucket
56 sourceforge
49 sr.ht 1 gitea 1 real 0m2.342s user 0m1.457s
sys 0m0.537s

3,585 total

182 unaccounted for

Sad seeing +1 github already.
Backslashes to quote quotes are eaten by Discourse. :confused:

sr.ht should be git.sr.ht and hg.sr.ht.

Also observed a significant number (1%?) still listed as http links.

Sources of apps in F-droid main repo

time for r in "" github gitlab code.google archive.softwareheritage codeberg bitbucket sourceforge projectmaxs.org framagit.org notabug.org gitorious.org code.launchpad git.launchpad launchpad.net hg.sr.ht git.sr.ht git.taler git.savannah invent.kde source.puri.sm 0xacab.org android.googlesource git.mob-dev gitea ; do echo $r ; rg --count-matches "\"sourceCode\": \"https://$r" index-v1-x.json ; rg --count-matches "\"sourceCode\": \"http://$r" index-v1-x.json ; done

3766
37
github
2986
gitlab
361
code.google
68
archive.softwareheritage
38
codeberg
62
bitbucket
56
sourceforge
49
1
projectmaxs.org
22
framagit.org
18
notabug.org
11
gitorious.org
9
code.launchpad
8
git.launchpad
1
launchpad.net
1
hg.sr.ht
1
git.sr.ht
3
git.taler
3
git.savannah
3
invent.kde
5
source.puri.sm
5
0xacab.org
4
android.googlesource
3
git.mob-dev
2
gitea
1

real	0m1.056s
user	0m0.714s
sys	0m0.332s

list them!

Ask politely! Don’t demand!

Just do it!

Just because “certain people” are “rude” about something, doesn’t mean others can’t be less of a asshole rude person…

"sourceCode": "http://subversion.ardour.org/svn/thirdparty/ardroid"
"sourceCode": "http://xwords.sourceforge.net/source.php"
"sourceCode": "http://dictionarymid.sourceforge.net/android.html"
"sourceCode": "http://code.webface.bg/dof-calculator.html"
"sourceCode": "http://bzr.ed.am/android/export-contacts"
"sourceCode": "http://bzr.ed.am/android/import-contacts"
"sourceCode": "http://code.seawolfsanctuary.com/keepingtracks"
"sourceCode": "http://sourceforge.jp/users/kodakana/pf/Lesser_Pad/scm"
"sourceCode": "http://projectmaxs.org/source"
... SNIP ...
"sourceCode": "http://projectmaxs.org/source"
"sourceCode": "http://www.jaqpot.net/netcounter"
"sourceCode": "http://git.zarb.org/?p=androidnetworktester"
"sourceCode": "http://git.tuxfamily.org/?p=scoretarot/git.git"
"sourceCode": "http://galexander.org/git/simplesshd.git/"
"sourceCode": "http://purl.org/net/smsfilter/source"
"sourceCode": "http://git.madore.org/cgit/android/UnicodeMap.git/"
"sourceCode": "http://svnandroid.wikidpad2.webfactional.com"

That’s not the list I’ve asked for…
( ͡° ͜ʖ ͡°)

note the sourcecode link is not the repo link, ie. not the one that is compiled from

I actually wanted to know which appID, the name of the .yml file :wink: