1 Introduction 2 Ground Rules Building a File System 3 File Systems 4 File Content Data Structure 5 Allocation Cluster Manager 6 Exceptions and Emancipation 7 Base Classes, Testing, and More 8 File Meta Data 9 Native File Class 10 Our File System 11 Allocation Table 12 File System Support Code 13 Initializing the File System 14 Contiguous Files 15 Rebuilding the File System 16 Native File System Support Methods 17 Lookups, Wildcards, and Unicode, Oh My 18 Finishing the File System Class The Init Program 19 Hardware Abstraction and UOS Architecture 20 Init Command Mode 21 Using Our File System 22 Hardware and Device Lists 23 Fun with Stores: Partitions 24 Fun with Stores: RAID 25 Fun with Stores: RAM Disks 26 Init wrap-up The Executive 27 Overview of The Executive 28 Starting the Kernel 29 The Kernel 30 Making a Store Bootable 31 The MMC 32 The HMC 33 Loading the components 34 Using the File Processor 35 Symbols and the SSC 36 The File Processor and Device Management 37 The File Processor and File System Management 38 Finishing Executive Startup Users and Security 39 Introduction to Users and Security 40 More Fun With Stores: File Heaps 41 File Heaps, part 2 42 SysUAF 43 TUser 44 SysUAF API Terminal I/O 45 Shells and UCL 46 UOS API, the Application Side 47 UOS API, the Executive Side 48 I/O Devices 49 Streams 50 Terminal Output Filters 51 The TTerminal Class 52 Handles 53 Putting it All Together 54 Getting Terminal Input 55 QIO 56 Cooking Terminal Input 57 Putting it all together, part 2 58 Quotas and I/O UCL 59 UCL Basics 60 Symbol Substitution 61 Command execution 62 Command execution, part 2 63 Command Abbreviation 64 ASTs 65 Expressions, Part 1 66 Expressions, Part 2: Support code 67 Expressions, part 3: Parsing 68 SYS_GETJPIW and SYS_TRNLNM 69 Expressions, part 4: Evaluation UCL Lexical Functions 70 PROCESS_SCAN 71 PROCESS_SCAN, Part 2 72 TProcess updates 73 Unicode revisted 74 Lexical functions: F$CONTEXT 75 Lexical functions: F$PID 76 Lexical Functions: F$CUNITS 77 Lexical Functions: F$CVSI and F$CVUI 78 UOS Date and Time Formatting 79 Lexical Functions: F$CVTIME 80 LIB_CVTIME 81 Date/Time Contexts 82 SYS_GETTIM, LIB_Get_Timestamp, SYS_ASCTIM, and LIB_SYS_ASCTIM 83 Lexical Functions: F$DELTA_TIME 84 Lexical functions: F$DEVICE 85 SYS_DEVICE_SCAN 86 Lexical functions: F$DIRECTORY 87 Lexical functions: F$EDIT and F$ELEMENT 88 Lexical functions: F$ENVIRONMENT 89 SYS_GETUAI 90 Lexical functions: F$EXTRACT and F$IDENTIFIER 91 LIB_FAO and LIB_FAOL 92 LIB_FAO and LIB_FAOL, part 2 93 Lexical functions: F$FAO 94 File Processing Structures 95 Lexical functions: F$FILE_ATTRIBUTES 96 SYS_DISPLAY 97 Lexical functions: F$GETDVI 98 Parse_GetDVI 99 GetDVI 100 GetDVI, part 2 101 GetDVI, part 3 102 Lexical functions: F$GETJPI 103 GETJPI 104 Lexical functions: F$GETSYI 105 GETSYI 106 Lexical functions: F$INTEGER, F$LENGTH, F$LOCATE, and F$MATCH_WILD 107 Lexical function: F$PARSE 108 FILESCAN 109 SYS_PARSE 110 Lexical Functions: F$MODE, F$PRIVILEGE, and F$PROCESS 111 File Lookup Service 112 Lexical Functions: F$SEARCH 113 SYS_SEARCH 114 F$SETPRV and SYS_SETPRV 115 Lexical Functions: F$STRING, F$TIME, and F$TYPE 116 More on symbols 117 Lexical Functions: F$TRNLNM 118 SYS_TRNLNM, Part 2 119 Lexical functions: F$UNIQUE, F$USER, and F$VERIFY 120 Lexical functions: F$MESSAGE 121 TUOS_File_Wrapper 122 OPEN, CLOSE, and READ system services UCL Commands 123 WRITE 124 Symbol assignment 125 The @ command 126 @ and EXIT 127 CRELNT system service 128 DELLNT system service 129 IF...THEN...ELSE 130 Comments, labels, and GOTO 131 GOSUB and RETURN 132 CALL, SUBROUTINE, and ENDSUBROUTINE 133 ON, SET {NO}ON, and error handling 134 INQUIRE 135 SYS_WRITE Service 136 OPEN 137 CLOSE 138 DELLNM system service 139 READ 140 Command Recall 141 RECALL 142 RUN 143 LIB_RUN 144 The Data Stream Interface 145 Preparing for execution 146 EOJ and LOGOUT 147 SYS_DELPROC and LIB_GET_FOREIGN CUSPs and utilities 148 The I/O Queue 149 Timers 150 Logging in, part one 151 Logging in, part 2 152 System configuration 153 SET NODE utility 154 UUI 155 SETTERM utility 156 SETTERM utility, part 2 157 SETTERM utility, part 3 158 AUTHORIZE utility 159 AUTHORIZE utility, UI 160 AUTHORIZE utility, Access Restrictions 161 AUTHORIZE utility, Part 4 162 AUTHORIZE utility, Reporting 163 AUTHORIZE utility, Part 6 164 Authentication 165 Hashlib 166 Authenticate, Part 7 167 Logging in, part 3 168 DAY_OF_WEEK, CVT_FROM_INTERNAL_TIME, and SPAWN 169 DAY_OF_WEEK and CVT_FROM_INTERNAL_TIME 170 LIB_SPAWN 171 CREPRC 172 CREPRC, Part 2 173 COPY 174 COPY, part 2 175 COPY, part 3 176 COPY, part 4 177 LIB_Get_Default_File_Protection and LIB_Substitute_Wildcards 178 CREATESTREAM, STREAMNAME, and Set_Contiguous 179 Help Files 180 LBR Services 181 LBR Services, Part 2 182 LIBRARY utility 183 LIBRARY utility, Part 2 184 FS Services 185 FS Services, Part 2 186 Implementing Help 187 HELP 188 HELP, Part 2 189 DMG_Get_Key and LIB_Put_Formatted_Output 190 LIBRARY utility, Part 3 191 Shutting Down UOS 192 SHUTDOWN 193 WAIT 194 SETIMR 195 WAITFR and Scheduling 196 REPLY, OPCOM, and Mailboxes 197 REPLY utility 198 Mailboxes 199 BRKTHRU 200 OPCOM 201 Mailbox Services 202 Mailboxes, Part 2 203 DEFINE 204 CRELNM 205 DISABLE 206 STOP 207 OPCCRASH and SHUTDOWN 208 APPEND Glossary/Index Downloads |
Initializing the File System
Note: I found two bugs in the TUOS_Native_File code. First, it should have been updating the EOF field of the header to match the highest address written to. It was a simple change in the _Write method and also the Set_Size method (when the file is being truncated, so that EOF isn't past the physical end of the file). Second, it was calculating the cluster incorrectly in Set_Size for boundary conditions. In this article, we start upon writing our file system class. First, the standard methods we see in all of our classes:
There is also a getter and setter for the heap. Since the file system component code cannot call the Microsoft Windows API to allocate dynamic memory, we need to ask UOS for a heap manager. However, our file system will also be used during system initialization, prior to UOS actually being loaded and available. In such case, we need the initialization code to be able to provide the component with a heap manager. Because hooking into the heap management this way will affect everything compiled with our file system class (which is all the classes that it uses), we don't need to bother with setting the heap for any other objects. This one place handles it all.
Delphi provides a simple way to intercept heap calls via the SetMemoryManager. Here is our implementation of NewMemMgr:
Before we can use a file system that is on a store, we need to have a file system structure on the store. So, the first thing we will write is our initialization method. Since Initialize is already a method name (for initializing the class instance), we will simply call our method Init.
Whatever code calls this method to initialize the file system passes a string label that serves as a "name" for the store, a number of patterns to run, the minimum cluster size of the store, the cluster to use for folders, some options (flags) for the store, and the offset of the allocation table on the store. Notifier is an optional callback that is used in surface scans. The code for this method is algorithmically simple, although it has to do several tasks. We will examine each of these tasks in turn. Flags is a bit mask with the following meanings:
These flags have no meaning to the Init method except that we will set the flags field in the store header to the passed value. We will discuss their use in the future.
First, we must have a store to initialize, so we exit if no store has been set. Then we make sure that the requested clustersize is no smaller than the store's minimum allocation size. In effect, if the caller passes 0 for the clustersize, it will default to the store's inherent cluster size. Next, we make sure that the folder clustersize is no smaller than the default clustersize. Then we normalize the cluster sizes so that they are even multiples of the store's cluster size. Once the clustersizes are verified/set we set the store's minimum store size to the new cluster size (which may be the same as the current one). Now we can calculate the total number of clusters on the store, and how large our allocation table must be. If the user requested the default AT position, we will set it so that it occupies the center of the store. In the case of disks, this should minimize the amount of disk head movement that results from updating the table while UOS is running. Of course, this assumes that the disk is mostly full so that there is an equal amount of data on both sides of the table. For a disk that is not near capacity, the table may have more data before it than after it, or vice versa. In general, placing it in the center of the store gives the best overall performance for most circumstances. Giving the user the opportunity to adjust the location of the table allows them to customize disk performance based on their needs and expectations. AT_Offset is passed by reference, so that if we assign it, the caller will know what the actual position is. "Wait!" you say - we've already said that the allocation table is always updated in memory and only written out to the store when it is dismounted. So, why would the position of the table have any bearing on disk performance? Well, this is generally correct. However, there is an exception in a certain case (dual-ported disks). In the normal situation, it doesn't matter where the table is positioned, so the center of the store is as good as any other place. In the exceptionally case, the position will have an impact on disk performance. So, placing it in the center works in both cases. Finally we allocate a couple of buffers large enough to hold the folder cluster size. Since the folder cluster size will never be smaller than the default clustersize, the buffers will be large enough for whatever purpose we need later in the method. At this point, we are ready to do the work of initializing. We will include the code in a try/finally block to make sure that the buffers are released when we exit:
Patterns and bad blocks
This callback passes a pointer to the file system object, a percent complete, and a count of bad clusters. Further, a boolean variable is passed by reference so that the process can be aborted by the caller before the surface scan completes. The next question that arises is, how do we remove these bad clusters from "circulation"? One thing we could do is simply mark the cluster as used in the allocation table. However, since we only write the allocation table out when the store is specifically dismounted, if the system crashes, we have to rebuild the table. But, this means that we lose the allocated bad clusters. So, we need to store the bad clusters in a more persistent area. There are many possible approaches to this, but we will use this one: we will have a file, appropriately called BadBlocks.sys, that will contain the bad clusters. An obvious approach is to simply have the file contents be a list of the bad clusters. A slightly less obvious, but more efficient, approach is to have the file be made up of the bad clusters themselves. That is, the cluster allocation chain itself will consist entirely of the bad clusters. Thus, the only overhead of the list of bad clusters is the header for the file and the allocation chain. This has the benefit of making sure the allocation table has the bad clusters marked as allocated if we have to rebuild the table since the rebuild function will simply process all the files on the store, running through the allocation chains and updating the table as it goes along. That is, the process doesn't need to know anything about bad blocks - it just deals with files. This is also more storage-efficient than storing the clusters as file data because the same amount of space would be used to store them in file data, but with the added overhead of the cluster chain for the file data. We need to build up a list of bad clusters so that we can create BadBlocks.sys with the appropriate clusters. We have to complete the list before we create this file (or do any other file system work for that matter) since we have to avoid making use of any bad clusters as we allocate space for our on-store file system structures. Here is the list and the local routine used to maintain this list as we do our scan:
We simply reallocate the buffer to be larger, then store the passed cluster address in the buffer. Now, let's examine the scanning process.
First, if the caller requested any patterns, we make sure the store has its low-level formatting. For most hard disks, this is done before the drive is sold and the Format method will do nothing. For things such as floppy disks, this will write out low-level formatting information that the media needs before we even bother with reading/writing sectors. Note that we have 6 specific bit patterns that we have defined. Beyond that, we simply use the current pattern iteration as the bit pattern.
We fill the buffer with the pattern, then we specifically try the root cluster. If we find that the boot sector is bad, we immediately exit since that is the one cluster that absolutely must be good since it is where we start to find the beginning of the file system structure. We write the one buffer, read into the other buffer, then compare each byte between the buffers. If the store has an error, that indicates that the drive detected the bad area (usually this indicates that low-level formatting information on the media is bad as well).
The rest of each pattern loop involves stepping through the rest of the clusters and doing the same test as we did for the root cluster, except that an error doesn't abort the process. Instead, we add the cluster to our list of bad blocks. Also, at the start of the cluster loop, we check to see if we are another 1% of the way through the whole scan process. If so, we do the callback and abort if told to. Once we have scanned the disk and built our list of bad blocks (if any), the next step is to make sure our initial allocation table represents these clusters as unavailable (allocated). Here is the code:
Creating the base file system Next, we need to mark the allocation table, itself, as allocated. We do this by telling the store to allocate the appropriate number of clusters at the determined offset. However, it is possible that some of the requested clusters are bad and thus already marked as allocated. So, if our request fails, we try to move the allocation table to a higher cluster offset. We repeat this process until we succeed or reach the final cluster. In this case, we then try the move the table to a lower cluster offset - we start at the first cluster and move up until we can find a place to put it. If we can't find a place to put the table, we exit with an error. Note that if we cannot find enough contiguous clusters for the allocation table, it probably indicates some media so riddled with errors that we probably don't want to use it anyway. Since AT_Offset is passed by reference, the caller will know exactly where the table ended up.
Now we are ready to create the actual file system structure on the store. This process will be different than normal file system access because there is no existing file system for methods like Create_File to operate on. Thus, we must manually set up the initial structure. We won't know the location of the store header until we allocate it, so we have to allocate it before we set the pointer in the boot sector. Further, we don't want to give the store the appearance of being properly set up until we finish it, so we won't set the boot sector pointer until the end. These restrictions basically tell us that we have to set up the store backwards. That is, we have to write the file and directory contents before we write the headers for those files and directories. And we have to set up the root directory before we set up the store header.
Although some operating systems have the store's allocation table represented as a file on the store, UOS doesn't do so. Yes, it could, but it would be extra work with no benefit. We don't allow anyone, except the managed store, to have direct access to the table data. The rebuild process will simply use the Allocate_At method to rebuild the table. So there is no need for the storage overhead of a file header. This means that the store's allocation table is the only structure on the store that isn't accessible via a file, but this one "inconsistency" is something I'm willing to live with!
If you are perceptive, you may have already realized that we have a "chicken or the egg" scenario. That is, all files have names and those names are pointers into the string table, which is a file. How does Strings.sys have a name within itself before it even exists? This is another reason why we can't use normal File System methods to create, or access, these files.
We can use the TUOS_Native_File class to access files since it knows nothing about the file system - it just deals with the allocation chain on the store, and a few portions of the file header. The root file has no name, so we don't need to concern ourselves with that. We set the clustersize and various dates. Then we set the root folder flags to indicate that this is a system file that cannot be deleted (FAF_PERMANENT) and is marked as a directory. We don't write the header out yet - we'll do that in a bit. Next, we create the string table, string index, and string allocation table. We set up the TUOS_Native_File instances and write the hand-coded initial data for these files.
Now we create the BadBlock.sys file header, and assign the bad allocation clusters. Note that this required adding the Set_Allocation_Cluster method to the TUOS_Native_File class.
We create the Store folder header, and then we write the headers for the other files to the store folder file. Finally we write the Store folder header to the root directory and then write the root directory header.
At this point, we have the basic file structure set up. Next, we write out the store header, itself, "plugging in" our root folder pointer.
Next we write out the allocation table, and finally we update the boot cluster.
And that's it! The store is now initialized with a fresh UOS file system. Here is the new Set_Allocation_Cluster method in TUOS_Native_File.
In turn, this calls a new method in the allocation cluster manager, which is defined as follows:
In the next articles, we will write the methods that will allow us to navigate and modify our file system. |