Git fork
at reftables-rust 982 lines 35 kB view raw
1Like other projects, we also have some guidelines for our code. For 2Git in general, a few rough rules are: 3 4 - Most importantly, we never say "It's in POSIX; we'll happily 5 ignore your needs should your system not conform to it." 6 We live in the real world. 7 8 - However, we often say "Let's stay away from that construct, 9 it's not even in POSIX". 10 11 - In spite of the above two rules, we sometimes say "Although 12 this is not in POSIX, it (is so convenient | makes the code 13 much more readable | has other good characteristics) and 14 practically all the platforms we care about support it, so 15 let's use it". 16 17 Again, we live in the real world, and it is sometimes a 18 judgement call, the decision based more on real world 19 constraints people face than what the paper standard says. 20 21 - Fixing style violations while working on a real change as a 22 preparatory clean-up step is good, but otherwise avoid useless code 23 churn for the sake of conforming to the style. 24 25 "Once it _is_ in the tree, it's not really worth the patch noise to 26 go and fix it up." 27 Cf. https://lore.kernel.org/all/20100126160632.3bdbe172.akpm@linux-foundation.org/ 28 29 - Log messages to explain your changes are as important as the 30 changes themselves. Clearly written code and in-code comments 31 explain how the code works and what is assumed from the surrounding 32 context. The log messages explain what the changes wanted to 33 achieve and why the changes were necessary (more on this in the 34 accompanying SubmittingPatches document). 35 36Make your code readable and sensible, and don't try to be clever. 37 38As for more concrete guidelines, just imitate the existing code 39(this is a good guideline, no matter which project you are 40contributing to). It is always preferable to match the _local_ 41convention. New code added to Git suite is expected to match 42the overall style of existing code. Modifications to existing 43code are expected to match the style the surrounding code already 44uses (even if it doesn't match the overall style of existing code). 45 46But if you must have a list of rules, here are some language 47specific ones. Note that Documentation/ToolsForGit.adoc document 48has a collection of tips to help you use some external tools 49to conform to these guidelines. 50 51For shell scripts specifically (not exhaustive): 52 53 - We use tabs for indentation. 54 55 - Case arms are indented at the same depth as case and esac lines, 56 like this: 57 58 case "$variable" in 59 pattern1) 60 do this 61 ;; 62 pattern2) 63 do that 64 ;; 65 esac 66 67 - Redirection operators should be written with space before, but no 68 space after them. In other words, write 'echo test >"$file"' 69 instead of 'echo test> $file' or 'echo test > $file'. Note that 70 even though it is not required by POSIX to double-quote the 71 redirection target in a variable (as shown above), our code does so 72 because some versions of bash issue a warning without the quotes. 73 74 (incorrect) 75 cat hello > world < universe 76 echo hello >$world 77 78 (correct) 79 cat hello >world <universe 80 echo hello >"$world" 81 82 - We prefer $( ... ) for command substitution; unlike ``, it 83 properly nests. It should have been the way Bourne spelled 84 it from day one, but unfortunately isn't. 85 86 - If you want to find out if a command is available on the user's 87 $PATH, you should use 'type <command>', instead of 'which <command>'. 88 The output of 'which' is not machine parsable and its exit code 89 is not reliable across platforms. 90 91 - We use POSIX compliant parameter substitutions and avoid bashisms; 92 namely: 93 94 - We use ${parameter-word} and its [-=?+] siblings, and their 95 colon'ed "unset or null" form. 96 97 - We use ${parameter#word} and its [#%] siblings, and their 98 doubled "longest matching" form. 99 100 - No "Substring Expansion" ${parameter:offset:length}. 101 102 - No shell arrays. 103 104 - No pattern replacement ${parameter/pattern/string}. 105 106 - We use Arithmetic Expansion $(( ... )). 107 108 - We do not use Process Substitution <(list) or >(list). 109 110 - Do not write control structures on a single line with semicolon. 111 "then" should be on the next line for if statements, and "do" 112 should be on the next line for "while" and "for". 113 114 (incorrect) 115 if test -f hello; then 116 do this 117 fi 118 119 (correct) 120 if test -f hello 121 then 122 do this 123 fi 124 125 - If a command sequence joined with && or || or | spans multiple 126 lines, put each command on a separate line and put && and || and | 127 operators at the end of each line, rather than the start. This 128 means you don't need to use \ to join lines, since the above 129 operators imply the sequence isn't finished. 130 131 (incorrect) 132 grep blob verify_pack_result \ 133 | awk -f print_1.awk \ 134 | sort >actual && 135 ... 136 137 (correct) 138 grep blob verify_pack_result | 139 awk -f print_1.awk | 140 sort >actual && 141 ... 142 143 - We prefer "test" over "[ ... ]". 144 145 - We do not write the noiseword "function" in front of shell 146 functions. 147 148 - We prefer a space between the function name and the parentheses, 149 and no space inside the parentheses. The opening "{" should also 150 be on the same line. 151 152 (incorrect) 153 my_function(){ 154 ... 155 156 (correct) 157 my_function () { 158 ... 159 160 - As to use of grep, stick to a subset of BRE (namely, no \{m,n\}, 161 [::], [==], or [..]) for portability. 162 163 - We do not use \{m,n\}; 164 165 - We do not use ? or + (which are \{0,1\} and \{1,\} 166 respectively in BRE) but that goes without saying as these 167 are ERE elements not BRE (note that \? and \+ are not even part 168 of BRE -- making them accessible from BRE is a GNU extension). 169 170 - Use Git's gettext wrappers in git-sh-i18n to make the user 171 interface translatable. See "Marking strings for translation" in 172 po/README. 173 174 - We do not write our "test" command with "-a" and "-o" and use "&&" 175 or "||" to concatenate multiple "test" commands instead, because 176 the use of "-a/-o" is often error-prone. E.g. 177 178 test -n "$x" -a "$a" = "$b" 179 180 is buggy and breaks when $x is "=", but 181 182 test -n "$x" && test "$a" = "$b" 183 184 does not have such a problem. 185 186 - Even though "local" is not part of POSIX, we make heavy use of it 187 in our test suite. We do not use it in scripted Porcelains, and 188 hopefully nobody starts using "local" before all shells that matter 189 support it (notably, ksh from AT&T Research does not support it yet). 190 191 - Some versions of shell do not understand "export variable=value", 192 so we write "variable=value" and then "export variable" on two 193 separate lines. 194 195 - Some versions of dash have broken variable assignment when prefixed 196 with "local", "export", and "readonly", in that the value to be 197 assigned goes through field splitting at $IFS unless quoted. 198 199 (incorrect) 200 local variable=$value 201 local variable=$(command args) 202 203 (correct) 204 local variable="$value" 205 local variable="$(command args)" 206 207 - The common construct 208 209 VAR=VAL command args 210 211 to temporarily set and export environment variable VAR only while 212 "command args" is running is handy, but this triggers an 213 unspecified behaviour according to POSIX when used for a command 214 that is not an external command (like shell functions). Indeed, 215 dash 0.5.10.2-6 on Ubuntu 20.04, /bin/sh on FreeBSD 13, and AT&T 216 ksh all make a temporary assignment without exporting the variable, 217 in such a case. As it does not work portably across shells, do not 218 use this syntax for shell functions. A common workaround is to do 219 an explicit export in a subshell, like so: 220 221 (incorrect) 222 VAR=VAL func args 223 224 (correct) 225 ( 226 VAR=VAL && 227 export VAR && 228 func args 229 ) 230 231 but be careful that the effect "func" makes to the variables in the 232 current shell will be lost across the subshell boundary. 233 234 - Use octal escape sequences (e.g. "\302\242"), not hexadecimal (e.g. 235 "\xc2\xa2") in printf format strings, since hexadecimal escape 236 sequences are not portable. 237 238 239For C programs: 240 241 - We use tabs to indent, and interpret tabs as taking up to 242 8 spaces. 243 244 - Nested C preprocessor directives are indented after the hash by one 245 space per nesting level. 246 247 #if FOO 248 # include <foo.h> 249 # if BAR 250 # include <bar.h> 251 # endif 252 #endif 253 254 - We try to keep to at most 80 characters per line. 255 256 - As a Git developer we assume you have a reasonably modern compiler 257 and we recommend you to enable the DEVELOPER makefile knob to 258 ensure your patch is clear of all compiler warnings we care about, 259 by e.g. "echo DEVELOPER=1 >>config.mak". 260 261 - When using DEVELOPER=1 mode, you may see warnings from the compiler 262 like "error: unused parameter 'foo' [-Werror=unused-parameter]", 263 which indicates that a function ignores its argument. If the unused 264 parameter can't be removed (e.g., because the function is used as a 265 callback and has to match a certain interface), you can annotate 266 the individual parameters with the UNUSED (or MAYBE_UNUSED) 267 keyword, like "int foo UNUSED". 268 269 - We try to support a wide range of C compilers to compile Git with, 270 including old ones. As of Git v2.35.0 Git requires C99 (we check 271 "__STDC_VERSION__"). You should not use features from a newer C 272 standard, even if your compiler groks them. 273 274 New C99 features have been phased in gradually, if something's new 275 in C99 but not used yet don't assume that it's safe to use, some 276 compilers we target have only partial support for it. These are 277 considered safe to use: 278 279 . since around 2007 with 2b6854c863a, we have been using 280 initializer elements which are not computable at load time. E.g.: 281 282 const char *args[] = { "constant", variable, NULL }; 283 284 . since early 2012 with e1327023ea, we have been using an enum 285 definition whose last element is followed by a comma. This, like 286 an array initializer that ends with a trailing comma, can be used 287 to reduce the patch noise when adding a new identifier at the end. 288 289 . since mid 2017 with cbc0f81d, we have been using designated 290 initializers for struct (e.g. "struct t v = { .val = 'a' };"). 291 292 . since mid 2017 with 512f41cf, we have been using designated 293 initializers for array (e.g. "int array[10] = { [5] = 2 }"). 294 295 . since early 2021 with 765dc168882, we have been using variadic 296 macros, mostly for printf-like trace and debug macros. 297 298 . since late 2021 with 44ba10d6, we have had variables declared in 299 the for loop "for (int i = 0; i < 10; i++)". 300 301 . since late 2023 with 8277dbe987 we have been using the bool type 302 from <stdbool.h>. 303 304 C99 features we have test balloons for: 305 306 . since late 2024 with v2.48.0-rc0~20, we have test balloons for 307 compound literal syntax, e.g., (struct foo){ .member = value }; 308 our hope is that no platforms we care about have trouble using 309 them, and officially adopt its wider use in mid 2026. Do not add 310 more use of the syntax until that happens. 311 312 New C99 features that we cannot use yet: 313 314 . %z and %zu as a printf() argument for a size_t (the %z being for 315 the POSIX-specific ssize_t). Instead you should use 316 printf("%"PRIuMAX, (uintmax_t)v). These days the MSVC version we 317 rely on supports %z, but the C library used by MinGW does not. 318 319 . Shorthand like ".a.b = *c" in struct initializations is known to 320 trip up an older IBM XLC version, use ".a = { .b = *c }" instead. 321 See the 33665d98 (reftable: make assignments portable to AIX xlc 322 v12.01, 2022-03-28). 323 324 - Variables have to be declared at the beginning of the block, before 325 the first statement (i.e. -Wdeclaration-after-statement). It is 326 encouraged to have a blank line between the end of the declarations 327 and the first statement in the block. 328 329 - Do not explicitly initialize global variables to 0 or NULL; 330 instead, let BSS take care of the zero initialization. 331 332 - NULL pointers shall be written as NULL, not as 0. 333 334 - When declaring pointers, the star sides with the variable 335 name, i.e. "char *string", not "char* string" or 336 "char * string". This makes it easier to understand code 337 like "char *string, c;". 338 339 - Use whitespace around operators and keywords, but not inside 340 parentheses and not around functions. So: 341 342 while (condition) 343 func(bar + 1); 344 345 and not: 346 347 while( condition ) 348 func (bar+1); 349 350 - A binary operator (other than ",") and ternary conditional "?:" 351 have a space on each side of the operator to separate it from its 352 operands. E.g. "A + 1", not "A+1". 353 354 - A unary operator (other than "." and "->") have no space between it 355 and its operand. E.g. "(char *)ptr", not "(char *) ptr". 356 357 - Do not explicitly compare an integral value with constant 0 or '\0', 358 or a pointer value with constant NULL. For instance, to validate that 359 counted array <ptr, cnt> is initialized but has no elements, write: 360 361 if (!ptr || cnt) 362 BUG("empty array expected"); 363 364 and not: 365 366 if (ptr == NULL || cnt != 0); 367 BUG("empty array expected"); 368 369 - We avoid using braces unnecessarily. I.e. 370 371 if (bla) { 372 x = 1; 373 } 374 375 is frowned upon. But there are a few exceptions: 376 377 - When the statement extends over a few lines (e.g., a while loop 378 with an embedded conditional, or a comment). E.g.: 379 380 while (foo) { 381 if (x) 382 one(); 383 else 384 two(); 385 } 386 387 if (foo) { 388 /* 389 * This one requires some explanation, 390 * so we're better off with braces to make 391 * it obvious that the indentation is correct. 392 */ 393 doit(); 394 } 395 396 - When there are multiple arms to a conditional and some of them 397 require braces, enclose even a single line block in braces for 398 consistency. E.g.: 399 400 if (foo) { 401 doit(); 402 } else { 403 one(); 404 two(); 405 three(); 406 } 407 408 - We try to avoid assignments in the condition of an "if" statement. 409 410 - Try to make your code understandable. You may put comments 411 in, but comments invariably tend to stale out when the code 412 they were describing changes. Often splitting a function 413 into two makes the intention of the code much clearer. 414 415 - Multi-line comments include their delimiters on separate lines from 416 the text. E.g. 417 418 /* 419 * A very long 420 * multi-line comment. 421 */ 422 423 Note however that a comment that explains a translatable string to 424 translators uses a convention of starting with a magic token 425 "TRANSLATORS: ", e.g. 426 427 /* 428 * TRANSLATORS: here is a comment that explains the string to 429 * be translated, that follows immediately after it. 430 */ 431 _("Here is a translatable string explained by the above."); 432 433 - Double negation is often harder to understand than no negation 434 at all. 435 436 - There are two schools of thought when it comes to comparison, 437 especially inside a loop. Some people prefer to have the less stable 438 value on the left hand side and the more stable value on the right hand 439 side, e.g. if you have a loop that counts variable i down to the 440 lower bound, 441 442 while (i > lower_bound) { 443 do something; 444 i--; 445 } 446 447 Other people prefer to have the textual order of values match the 448 actual order of values in their comparison, so that they can 449 mentally draw a number line from left to right and place these 450 values in order, i.e. 451 452 while (lower_bound < i) { 453 do something; 454 i--; 455 } 456 457 Both are valid, and we use both. However, the more "stable" the 458 stable side becomes, the more we tend to prefer the former 459 (comparison with a constant, "i > 0", is an extreme example). 460 Just do not mix styles in the same part of the code and mimic 461 existing styles in the neighbourhood. 462 463 - There are two schools of thought when it comes to splitting a long 464 logical line into multiple lines. Some people push the second and 465 subsequent lines far enough to the right with tabs and align them: 466 467 if (the_beginning_of_a_very_long_expression_that_has_to || 468 span_more_than_a_single_line_of || 469 the_source_text) { 470 ... 471 472 while other people prefer to align the second and the subsequent 473 lines with the column immediately inside the opening parenthesis, 474 with tabs and spaces, following our "tabstop is always a multiple 475 of 8" convention: 476 477 if (the_beginning_of_a_very_long_expression_that_has_to || 478 span_more_than_a_single_line_of || 479 the_source_text) { 480 ... 481 482 Both are valid, and we use both. Again, just do not mix styles in 483 the same part of the code and mimic existing styles in the 484 neighbourhood. 485 486 - When splitting a long logical line, some people change line before 487 a binary operator, so that the result looks like a parse tree when 488 you turn your head 90-degrees counterclockwise: 489 490 if (the_beginning_of_a_very_long_expression_that_has_to 491 || span_more_than_a_single_line_of_the_source_text) { 492 493 while other people prefer to leave the operator at the end of the 494 line: 495 496 if (the_beginning_of_a_very_long_expression_that_has_to || 497 span_more_than_a_single_line_of_the_source_text) { 498 499 Both are valid, but we tend to use the latter more, unless the 500 expression gets fairly complex, in which case the former tends to 501 be easier to read. Again, just do not mix styles in the same part 502 of the code and mimic existing styles in the neighbourhood. 503 504 - When splitting a long logical line, with everything else being 505 equal, it is preferable to split after the operator at higher 506 level in the parse tree. That is, this is more preferable: 507 508 if (a_very_long_variable * that_is_used_in + 509 a_very_long_expression) { 510 ... 511 512 than 513 514 if (a_very_long_variable * 515 that_is_used_in + a_very_long_expression) { 516 ... 517 518 - Some clever tricks, like using the !! operator with arithmetic 519 constructs, can be extremely confusing to others. Avoid them, 520 unless there is a compelling reason to use them. 521 522 - Use the API. No, really. We have a strbuf (variable length 523 string), several arrays with the ALLOC_GROW() macro, a 524 string_list for sorted string lists, a hash map (mapping struct 525 objects) named "struct decorate", amongst other things. 526 527 - When you come up with an API, document its functions and structures 528 in the header file that exposes the API to its callers. Use what is 529 in "strbuf.h" as a model for the appropriate tone and level of 530 detail. 531 532 - The first #include in C files, except in platform specific compat/ 533 implementations and sha1dc/, must be <git-compat-util.h>. This 534 header file insulates other header files and source files from 535 platform differences, like which system header files must be 536 included in what order, and what C preprocessor feature macros must 537 be defined to trigger certain features we expect out of the system. 538 A collorary to this is that C files should not directly include 539 system header files themselves. 540 541 There are some exceptions, because certain group of files that 542 implement an API all have to include the same header file that 543 defines the API and it is convenient to include <git-compat-util.h> 544 there. Namely: 545 546 - the implementation of the built-in commands in the "builtin/" 547 directory that include "builtin.h" for the cmd_foo() prototype 548 definition, 549 550 - the test helper programs in the "t/helper/" directory that include 551 "t/helper/test-tool.h" for the cmd__foo() prototype definition, 552 553 - the xdiff implementation in the "xdiff/" directory that includes 554 "xdiff/xinclude.h" for the xdiff machinery internals, 555 556 - the unit test programs in "t/unit-tests/" directory that include 557 "t/unit-tests/test-lib.h" that gives them the unit-tests 558 framework, and 559 560 - the source files that implement reftable in the "reftable/" 561 directory that include "reftable/system.h" for the reftable 562 internals, 563 564 are allowed to assume that they do not have to include 565 <git-compat-util.h> themselves, as it is included as the first 566 '#include' in these header files. These headers must be the first 567 header file to be "#include"d in them, though. 568 569 - A C file must directly include the header files that declare the 570 functions and the types it uses, except for the functions and types 571 that are made available to it by including one of the header files 572 it must include by the previous rule. 573 574 - If you are planning a new command, consider writing it in shell 575 or perl first, so that changes in semantics can be easily 576 changed and discussed. Many Git commands started out like 577 that, and a few are still scripts. 578 579 - Avoid introducing a new dependency into Git. This means you 580 usually should stay away from scripting languages not already 581 used in the Git core command set (unless your command is clearly 582 separate from it, such as an importer to convert random-scm-X 583 repositories to Git). 584 585 - When we pass <string, length> pair to functions, we should try to 586 pass them in that order. 587 588 - Use Git's gettext wrappers to make the user interface 589 translatable. See "Marking strings for translation" in po/README. 590 591 - Variables and functions local to a given source file should be marked 592 with "static". Variables that are visible to other source files 593 must be declared with "extern" in header files. However, function 594 declarations should not use "extern", as that is already the default. 595 596 - You can launch gdb around your program using the shorthand GIT_DEBUGGER. 597 Run `GIT_DEBUGGER=1 ./bin-wrappers/git foo` to simply use gdb as is, or 598 run `GIT_DEBUGGER="<debugger> <debugger-args>" ./bin-wrappers/git foo` to 599 use your own debugger and arguments. Example: `GIT_DEBUGGER="ddd --gdb" 600 ./bin-wrappers/git log` (See `bin-wrappers/wrap-for-bin.sh`.) 601 602 - The primary data structure that a subsystem 'S' deals with is called 603 `struct S`. Functions that operate on `struct S` are named 604 `S_<verb>()` and should generally receive a pointer to `struct S` as 605 first parameter. E.g. 606 607 struct strbuf; 608 609 void strbuf_add(struct strbuf *buf, ...); 610 611 void strbuf_reset(struct strbuf *buf); 612 613 is preferred over: 614 615 struct strbuf; 616 617 void add_string(struct strbuf *buf, ...); 618 619 void reset_strbuf(struct strbuf *buf); 620 621 - There are several common idiomatic names for functions performing 622 specific tasks on a structure `S`: 623 624 - `S_init()` initializes a structure without allocating the 625 structure itself. 626 627 - `S_release()` releases a structure's contents without reinitializing 628 the structure for immediate reuse, and without freeing the structure 629 itself. 630 631 - `S_clear()` is equivalent to `S_release()` followed by `S_init()` 632 such that the structure is directly usable after clearing it. When 633 `S_clear()` is provided, `S_init()` shall not allocate resources 634 that need to be released again. 635 636 - `S_free()` releases a structure's contents and frees the 637 structure. 638 639 - Function names should be clear and descriptive, accurately reflecting 640 their purpose or behavior. Arbitrary suffixes that do not add meaningful 641 context can lead to confusion, particularly for newcomers to the codebase. 642 643 Historically, the '_1' suffix has been used in situations where: 644 645 - A function handles one element among a group that requires similar 646 processing. 647 - A recursive function has been separated from its setup phase. 648 649 The '_1' suffix can be used as a concise way to indicate these specific 650 cases. However, it is recommended to find a more descriptive name wherever 651 possible to improve the readability and maintainability of the code. 652 653 - Bit fields should be defined without a space around the colon. E.g. 654 655 unsigned my_field:1; 656 unsigned other_field:1; 657 unsigned field_with_longer_name:1; 658 659For Perl programs: 660 661 - Most of the C guidelines above apply. 662 663 - We try to support Perl 5.8.1 and later ("use Perl 5.008001"). 664 665 - use strict and use warnings are strongly preferred. 666 667 - Don't overuse statement modifiers unless using them makes the 668 result easier to follow. 669 670 ... do something ... 671 do_this() unless (condition); 672 ... do something else ... 673 674 is more readable than: 675 676 ... do something ... 677 unless (condition) { 678 do_this(); 679 } 680 ... do something else ... 681 682 *only* when the condition is so rare that do_this() will be almost 683 always called. 684 685 - We try to avoid assignments inside "if ()" conditions. 686 687 - Learn and use Git.pm if you need that functionality. 688 689For Python scripts: 690 691 - We follow PEP-8 (https://peps.python.org/pep-0008/). 692 693 - As a minimum, we aim to be compatible with Python 2.7. 694 695 - Where required libraries do not restrict us to Python 2, we try to 696 also be compatible with Python 3.1 and later. 697 698 699Program Output 700 701 We make a distinction between a Git command's primary output and 702 output which is merely chatty feedback (for instance, status 703 messages, running transcript, or progress display), as well as error 704 messages. Roughly speaking, a Git command's primary output is that 705 which one might want to capture to a file or send down a pipe; its 706 chatty output should not interfere with these use-cases. 707 708 As such, primary output should be sent to the standard output stream 709 (stdout), and chatty output should be sent to the standard error 710 stream (stderr). Examples of commands which produce primary output 711 include `git log`, `git show`, and `git branch --list` which generate 712 output on the stdout stream. 713 714 Not all Git commands have primary output; this is often true of 715 commands whose main function is to perform an action. Some action 716 commands are silent, whereas others are chatty. An example of a 717 chatty action commands is `git clone` with its "Cloning into 718 '<path>'..." and "Checking connectivity..." status messages which it 719 sends to the stderr stream. 720 721 Error messages from Git commands should always be sent to the stderr 722 stream. 723 724 725Error Messages 726 727 - Do not end a single-sentence error message with a full stop. 728 729 - Do not capitalize the first word, only because it is the first word 730 in the message ("unable to open '%s'", not "Unable to open '%s'"). But 731 "SHA-3 not supported" is fine, because the reason the first word is 732 capitalized is not because it is at the beginning of the sentence, 733 but because the word would be spelled in capital letters even when 734 it appeared in the middle of the sentence. 735 736 - Say what the error is first ("cannot open '%s'", not "%s: cannot open"). 737 738 - Enclose the subject of an error inside a pair of single quotes, 739 e.g. `die(_("unable to open '%s'"), path)`. 740 741 - Unless there is a compelling reason not to, error messages from 742 porcelain commands should be marked for translation, e.g. 743 `die(_("bad revision %s"), revision)`. 744 745 - Error messages from the plumbing commands are sometimes meant for 746 machine consumption and should not be marked for translation, 747 e.g., `die("bad revision %s", revision)`. 748 749 - BUG("message") are for communicating the specific error to developers, 750 thus should not be translated. 751 752 753Externally Visible Names 754 755 - For configuration variable names, follow the existing convention: 756 757 . The section name indicates the affected subsystem. 758 759 . The subsection name, if any, indicates which of an unbounded set 760 of things to set the value for. 761 762 . The variable name describes the effect of tweaking this knob. 763 764 The section and variable names that consist of multiple words are 765 formed by concatenating the words without punctuation marks (e.g. `-`), 766 and are broken using bumpyCaps in documentation as a hint to the 767 reader. 768 769 When choosing the variable namespace, do not use variable name for 770 specifying possibly unbounded set of things, most notably anything 771 an end user can freely come up with (e.g. branch names). Instead, 772 use subsection names or variable values, like the existing variable 773 branch.<name>.description does. 774 775 776Writing Documentation: 777 778 Most (if not all) of the documentation pages are written in the 779 AsciiDoc format in *.adoc files (e.g. Documentation/git.adoc), and 780 processed into HTML and manpages (e.g. git.html and git.1 in the 781 same directory). 782 783 The documentation liberally mixes US and UK English (en_US/UK) 784 norms for spelling and grammar, which is somewhat unfortunate. 785 In an ideal world, it would have been better if it consistently 786 used only one and not the other, and we would have picked en_US 787 (if you wish to correct the English of some of the existing 788 documentation, please see the documentation-related advice in the 789 Documentation/SubmittingPatches file). 790 791 In order to ensure the documentation is inclusive, avoid assuming 792 that an unspecified example person is male or female, and think 793 twice before using "he", "him", "she", or "her". Here are some 794 tips to avoid use of gendered pronouns: 795 796 - Prefer succinctness and matter-of-factly describing functionality 797 in the abstract. E.g. 798 799 `--short`:: Emit output in the short-format. 800 801 and avoid something like these overly verbose alternatives: 802 803 `--short`:: Use this to emit output in the short-format. 804 `--short`:: You can use this to get output in the short-format. 805 `--short`:: A user who prefers shorter output could.... 806 `--short`:: Should a person and/or program want shorter output, he 807 she/they/it can... 808 809 This practice often eliminates the need to involve human actors in 810 your description, but it is a good practice regardless of the 811 avoidance of gendered pronouns. 812 813 - When it becomes awkward to stick to this style, prefer "you" when 814 addressing the hypothetical user, and possibly "we" when 815 discussing how the program might react to the user. E.g. 816 817 You can use this option instead of `--xyz`, but we might remove 818 support for it in future versions. 819 820 while keeping in mind that you can probably be less verbose, e.g. 821 822 Use this instead of `--xyz`. This option might be removed in future 823 versions. 824 825 - If you still need to refer to an example person that is 826 third-person singular, you may resort to "singular they" to avoid 827 "he/she/him/her", e.g. 828 829 A contributor asks their upstream to pull from them. 830 831 Note that this sounds ungrammatical and unnatural to those who 832 learned that "they" is only used for third-person plural, e.g. 833 those who learn English as a second language in some parts of the 834 world. 835 836 Every user-visible change should be reflected in the documentation. 837 The same general rule as for code applies -- imitate the existing 838 conventions. 839 840 841Markup: 842 843 Literal parts (e.g. use of command-line options, command names, 844 branch names, URLs, pathnames (files and directories), configuration and 845 environment variables) must be typeset as verbatim (i.e. wrapped with 846 backticks): 847 `--pretty=oneline` 848 `git rev-list` 849 `remote.pushDefault` 850 `http://git.example.com` 851 `.git/config` 852 `GIT_DIR` 853 `HEAD` 854 `umask`(2) 855 856 An environment variable must be prefixed with "$" only when referring to its 857 value and not when referring to the variable itself, in this case there is 858 nothing to add except the backticks: 859 `GIT_DIR` is specified 860 `$GIT_DIR/hooks/pre-receive` 861 862 Word phrases enclosed in `backtick characters` are rendered literally 863 and will not be further expanded. The use of `backticks` to achieve the 864 previous rule means that literal examples should not use AsciiDoc 865 escapes. 866 Correct: 867 `--pretty=oneline` 868 Incorrect: 869 `\--pretty=oneline` 870 871 Placeholders are spelled in lowercase and enclosed in 872 angle brackets surrounded by underscores: 873 _<file>_ 874 _<commit>_ 875 876 If a placeholder has multiple words, they are separated by dashes: 877 _<new-branch-name>_ 878 _<template-directory>_ 879 880 When needed, use a distinctive identifier for placeholders, usually 881 made of a qualification and a type: 882 _<git-dir>_ 883 _<key-id>_ 884 885Characters are also surrounded by underscores: 886 _LF_, _CR_, _CR_/_LF_, _NUL_, _EOF_ 887 888 Git's Asciidoc processor has been tailored to treat backticked text 889 as complex synopsis. When literal and placeholders are mixed, you can 890 use the backtick notation which will take care of correctly typesetting 891 the content. 892 `--jobs <n>` 893 `--sort=<key>` 894 `<directory>/.git` 895 `remote.<name>.mirror` 896 `ssh://[<user>@]<host>[:<port>]/<path-to-git-repo>` 897 898As a side effect, backquoted placeholders are correctly typeset, but 899this style is not recommended. 900 901 When documenting multiple related `git config` variables, place them on 902 a separate line instead of separating them by commas. For example, do 903 not write this: 904 `core.var1`, `core.var2`:: 905 Description common to `core.var1` and `core.var2`. 906 907Instead write this: 908 `core.var1`:: 909 `core.var2`:: 910 Description common to `core.var1` and `core.var2`. 911 912Synopsis Syntax 913 914 The synopsis (a paragraph with [synopsis] attribute) is automatically 915 formatted by the toolchain and does not need typesetting. 916 917 A few commented examples follow to provide reference when writing or 918 modifying command usage strings and synopsis sections in the manual 919 pages: 920 921 Possibility of multiple occurrences is indicated by three dots: 922 <file>... 923 (One or more of <file>.) 924 925 Optional parts are enclosed in square brackets: 926 [<file>...] 927 (Zero or more of <file>.) 928 929 An optional parameter needs to be typeset with unconstrained pairs 930 [<repository>] 931 932 --exec-path[=<path>] 933 (Option with an optional argument. Note that the "=" is inside the 934 brackets.) 935 936 [<patch>...] 937 (Zero or more of <patch>. Note that the dots are inside, not 938 outside the brackets.) 939 940 Multiple alternatives are indicated with vertical bars: 941 [-q | --quiet] 942 [--utf8 | --no-utf8] 943 944 Use spacing around "|" token(s), but not immediately after opening or 945 before closing a [] or () pair: 946 Do: [-q | --quiet] 947 Don't: [-q|--quiet] 948 949 Don't use spacing around "|" tokens when they're used to separate the 950 alternate arguments of an option: 951 Do: --track[=(direct|inherit)] 952 Don't: --track[=(direct | inherit)] 953 954 Parentheses are used for grouping: 955 [(<rev>|<range>)...] 956 (Any number of either <rev> or <range>. Parens are needed to make 957 it clear that "..." pertains to both <rev> and <range>.) 958 959 [(-p <parent>)...] 960 (Any number of option -p, each with one <parent> argument.) 961 962 git remote set-head <name> (-a|-d|<branch>) 963 (One and only one of "-a", "-d" or "<branch>" _must_ (no square 964 brackets) be provided.) 965 966 And a somewhat more contrived example: 967 --diff-filter=[(A|C|D|M|R|T|U|X|B)...[*]] 968 Here "=" is outside the brackets, because "--diff-filter=" is a 969 valid usage. "*" has its own pair of brackets, because it can 970 (optionally) be specified only when one or more of the letters is 971 also provided. 972 973 A note on notation: 974 Use 'git' (all lowercase) when talking about commands i.e. something 975 the user would type into a shell and use 'Git' (uppercase first letter) 976 when talking about the version control system and its properties. 977 978 If some place in the documentation needs to typeset a command usage 979 example with inline substitutions, it is fine to use +monospaced and 980 inline substituted text+ instead of `monospaced literal text`, and with 981 the former, the part that should not get substituted must be 982 quoted/escaped.